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PREFACE  TO  THE  THIRD  EDITION 


No  extensive  changes  have  been  made  in  this  edition.  The  most  impor¬ 
tant  are  in  §§  80-82,  which  I  have  rewritten  in  accordance  with  suggestions 
made  by  Mr  S.  Pollard. 

The  earlier  editions  contained  no  satisfactory  account  of  the  genesis  of 
the  circular  functions.  I  have  made  some  attempt  to  meet  this  objection 
in  §  158  and  Appendix  III.  Appendix  IV  is  also  an  addition. 

It  is  curious  to  note  how  the  character  of  the  criticisms  I  have  had  to 
meet  has  changed.  I  was  too  meticulous  and  pedantic  for  my  pupils  of 
fifteen  years  ago:  I  am  altogether  too  popular  for  the  Trinity  scholar  of 
to-day.  I  need  hardly  say  that  I  find  such  criticisms  very  gratifying,  as  the 
best  evidence  that  the  book  has  to  some  extent  fulfilled  the  purpose  with 
which  it  was  written. 


August  1921 


G.  H.  H. 


EXTRACT  FROM  THE  PREFACE  TO  THE 
SECOND  EDITION 

The  principal  changes  made  in  this  edition  are  as  follows.  I  have  in¬ 
serted  in  Chapter  I  a  sketch  of  Dedekind’s  theory  of  real  numbers,  and  a 
proof  of  Weierstrass’s  theorem  concerning  points  of  condensation;  in  Chap¬ 
ter  IV  an  account  of  ‘limits  of  indetermination ’  and  the  ‘general  principle  of 
convergence’;  in  Chapter  V  a  proof  of  the  ‘Heine-Borel  Theorem’,  Heine’s 
theorem  concerning  uniform  continuity,  and  the  fundamental  theorem  con¬ 
cerning  implicit  functions;  in  Chapter  VI  some  additional  matter  concern¬ 
ing  the  integration  of  algebraical  functions;  and  in  Chapter  VII  a  section 
on  differentials.  I  have  also  rewritten  in  a  more  general  form  the  sections 
which  deal  with  the  definition  of  the  definite  integral.  In  order  to  find 
space  for  these  insertions  I  have  deleted  a  good  deal  of  the  analytical  ge¬ 
ometry  and  formal  trigonometry  contained  in  Chapters  II  and  III  of  the 


first  edition.  These  changes  have  naturally  involved  a  large  number  of 
minor  alterations. 

G.  H.  H. 

October  1914 

EXTRACT  FROM  THE  PREFACE  TO  THE  FIRST 

EDITION 

This  book  has  been  designed  primarily  for  the  use  of  first  year  students 
at  the  Universities  whose  abilities  reach  or  approach  something  like  what  is 
usually  described  as  ‘scholarship  standard’.  I  hope  that  it  may  be  useful  to 
other  classes  of  readers,  but  it  is  this  class  whose  wants  I  have  considered 
first.  It  is  in  any  case  a  book  for  mathematicians:  I  have  nowhere  made 
any  attempt  to  meet  the  needs  of  students  of  engineering  or  indeed  any 
class  of  students  whose  interests  are  not  primarily  mathematical. 

I  regard  the  book  as  being  really  elementary.  There  are  plenty  of  hard 
examples  (mainly  at  the  ends  of  the  chapters):  to  these  I  have  added, 
wherever  space  permitted,  an  outline  of  the  solution.  But  I  have  done  my 
best  to  avoid  the  inclusion  of  anything  that  involves  really  difficult  ideas. 
For  instance,  I  make  no  use  of  the  ‘principle  of  convergence’:  uniform 
convergence,  double  series,  infinite  products,  are  never  alluded  to:  and 
I  prove  no  general  theorems  whatever  concerning  the  inversion  of  limit- 

,92/  d2j 

operations—  I  never  even  define  — — —  and  — — — .  In  the  last  two  chapters  I 

ox  Oy  ay  ox 

have  occasion  once  or  twice  to  integrate  a  power-series,  but  I  have  confined 
myself  to  the  very  simplest  cases  and  given  a  special  discussion  in  each 
instance.  Anyone  who  has  read  this  book  will  be  in  a  position  to  read  with 
profit  Dr  Bromwich’s  Infinite  Series,  where  a  full  and  adequate  discussion 
of  all  these  points  will  be  found. 


September  1908 
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CHAPTER  I 


REAL  VARIABLES 

1.  Rational  numbers.  A  fraction  r  =  p/q,  where  p  and  q  are  pos¬ 
itive  or  negative  integers,  is  called  a  rational  number.  We  can  suppose 
(i)  that  p  and  q  have  no  common  factor,  as  if  they  have  a  common  factor 
we  can  divide  each  of  them  by  it,  and  (ii)  that  q  is  positive,  since 

p/(-q)  =  (~p)/q,  (-p)/{-q)  =  p/q- 

To  the  rational  numbers  thus  defined  we  may  add  the  ‘rational  number  0’ 
obtained  by  taking  p  —  0. 

We  assume  that  the  reader  is  familiar  with  the  ordinary  arithmetical 
rules  for  the  manipulation  of  rational  numbers.  The  examples  which  follow 
demand  no  knowledge  beyond  this. 

Examples  I.  1.  If  r  and  s  are  rational  numbers,  then  r  +  s,  r  —  s,  rs, 
and  r/s  are  rational  numbers,  unless  in  the  last  case  s  =  0  (when  r/s  is  of  course 
meaningless). 

2.  If  A,  m,  and  n  are  positive  rational  numbers,  and  rn  >  n,  then 
A (m2  —  n2),  2A mn,  and  A (m2  +  n2)  are  positive  rational  numbers.  Hence  show 
how  to  determine  any  number  of  right-angled  triangles  the  lengths  of  all  of 
whose  sides  are  rational. 

3.  Any  terminated  decimal  represents  a  rational  number  whose  denomina¬ 
tor  contains  no  factors  other  than  2  or  5.  Conversely,  any  such  rational  number 
can  be  expressed,  and  in  one  way  only,  as  a  terminated  decimal. 

[The  general  theory  of  decimals  will  be  considered  in  Ch.  IV.] 

4.  The  positive  rational  numbers  may  be  arranged  in  the  form  of  a  simple 
series  as  follows: 

1213214321 
1’  1’  2’  1’  2’  3’  1’  2’  3’  4  ’  ■ • • • 

Show  that  p/q  is  the  [\{p  +  q  —  l)(p  +  q  —  2)  +  g]th  term  of  the  series. 

[In  this  series  every  rational  number  is  repeated  indefinitely.  Thus  1  occurs 
as  j ,  | ,  §,••••  We  can  of  course  avoid  this  by  omitting  every  number  which  has 
already  occurred  in  a  simpler  form,  but  then  the  problem  of  determining  the 
precise  position  of  p/q  becomes  more  complicated.] 
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2.  The  representation  of  rational  numbers  by  points  on  a  line. 

It  is  convenient,  in  many  branches  of  mathematical  analysis,  to  make  a 
good  deal  of  use  of  geometrical  illustrations. 

The  use  of  geometrical  illustrations  in  this  way  does  not,  of  course, 
imply  that  analysis  has  any  sort  of  dependence  upon  geometry:  they  are 
illustrations  and  nothing  more,  and  are  employed  merely  for  the  sake  of 
clearness  of  exposition.  This  being  so,  it  is  not  necessary  that  we  should 
attempt  any  logical  analysis  of  the  ordinary  notions  of  elementary  geome¬ 
try;  we  may  be  content  to  suppose,  however  far  it  may  be  from  the  truth, 
that  we  know  what  they  mean. 

Assuming,  then,  that  we  know  what  is  meant  by  a  straight  line,  a 
segment  of  a  line,  and  the  length  of  a  segment,  let  us  take  a  straight  line  A, 
produced  indefinitely  in  both  directions,  and  a  segment  AqA\  of  any  length. 
We  call  Ao  the  origin ,  or  the  point  0,  and  A\  the  point  1,  and  we  regard 
these  points  as  representing  the  numbers  0  and  1. 

In  order  to  obtain  a  point  which  shall  represent  a  positive  rational 
number  r  =  p/q,  we  choose  the  point  Ar  such  that 


AoAr/A0A1  =  r, 


AqAt  being  a  stretch  of  the  line  extending  in  the  same  direction  along  the 
line  as  AoAi,  a  direction  which  we  shall  suppose  to  be  from  left  to  right 
when,  as  in  Fig.  1,  the  line  is  drawn  horizontally  across  the  paper.  In 
order  to  obtain  a  point  to  represent  a  negative  rational  number  r  =  — s, 

- 1 - 1 - 1 - 1 - 1 - 

A-s  A_i  Aq  A\  As 

Fig.  l. 

it  is  natural  to  regard  length  as  a  magnitude  capable  of  sign,  positive  if 
the  length  is  measured  in  one  direction  (that  of  AoAi),  and  negative  if 
measured  in  the  other,  so  that  AB  =  —BA]  and  to  take  as  the  point 
representing  r  the  point  A_s  such  that 

AqA_s  =  — A_sAq  =  — AqAs. 
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We  thus  obtain  a  point  Ar  on  the  line  corresponding  to  every  rational 
value  of  r,  positive  or  negative,  and  such  that 


A A  —  r  ■  AA; 


and  if,  as  is  natural,  we  take  AA  as  our  unit  of  length,  and  write 
A  A  =  1,  then  we  have 

AA  =  t. 

We  shall  call  the  points  A  the  rational  points  of  the  line. 

3.  Irrational  numbers.  If  the  reader  will  mark  off  on  the  line  all 
the  points  corresponding  to  the  rational  numbers  whose  denominators  are 
1,  2,  3, . . .  in  succession,  he  will  readily  convince  himself  that  he  can  cover 
the  line  with  rational  points  as  closely  as  he  likes.  We  can  state  this  more 
precisely  as  follows:  if  we  take  any  segment  BC  on  A  ,  we  can  find  as  many 
rational  points  as  we  please  on  BC. 

Suppose,  for  example,  that  BC  falls  within  the  segment  AA.  It  is 
evident  that  if  we  choose  a  positive  integer  k  so  that 

k-BC>l*  (1) 

and  divide  AA  into  k  equal  parts,  then  at  least  one  of  the  points  of 
division  (say  P)  must  fall  inside  BC,  without  coinciding  with  either  B  or  C. 
For  if  this  were  not  so,  BC  would  be  entirely  included  in  one  of  the  k  parts 
into  which  A  A  has  been  divided,  which  contradicts  the  supposition  (1). 
But  P  obviously  corresponds  to  a  rational  number  whose  denominator  is  k. 
Thus  at  least  one  rational  point  P  lies  between  B  and  C.  But  then  we  can 
find  another  such  point  Q  between  B  and  P,  another  between  B  and  Q, 
and  so  on  indefinitely;  i.e.,  as  we  asserted  above,  we  can  find  as  many  as 
we  please.  We  may  express  this  by  saying  that  BC  includes  infinitely  many 
rational  points. 

*The  assumption  that  this  is  possible  is  equivalent  to  the  assumption  of  what  is 
known  as  the  Axiom  of  Archimedes. 
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The  meaning  of  such  phrases  as  1 infinitely  many ’  or  ‘an  infinity  of\  in  such 
sentences  as  LBC  includes  infinitely  many  rational  points’  or  ‘there  are  an  infinity 
of  rational  points  on  BC'  or  ‘there  are  an  infinity  of  positive  integers’,  will  be 
considered  more  closely  in  Ch.  IV.  The  assertion  ‘there  are  an  infinity  of  positive 
integers’  means  ‘given  any  positive  integer  n,  however  large,  we  can  find  more 
than  n  positive  integers’.  This  is  plainly  true  whatever  n  may  be,  e.g.  for 
n  =  100,000  or  100,000,000.  The  assertion  means  exactly  the  same  as  ‘we  can 
find  as  many  positive  integers  as  we  please\ 

The  reader  will  easily  convince  himself  of  the  truth  of  the  following  assertion, 
which  is  substantially  equivalent  to  what  was  proved  in  the  second  paragraph 
of  this  section:  given  any  rational  number  r,  and  any  positive  integer  n,  we  can 
find  another  rational  number  lying  on  either  side  of  r  and  differing  from  r  by 
less  than  1/n.  It  is  merely  to  express  this  differently  to  say  that  we  can  find 
a  rational  number  lying  on  either  side  of  r  and  differing  from  r  by  as  little  as 
we  please.  Again,  given  any  two  rational  numbers  r  and  s,  we  can  interpolate 
between  them  a  chain  of  rational  numbers  in  which  any  two  consecutive  terms 
differ  by  as  little  as  we  please,  that  is  to  say  by  less  than  1/n,  where  n  is  any 
positive  integer  assigned  beforehand. 

From  these  considerations  the  reader  might  be  tempted  to  infer  that  an 
adequate  view  of  the  nature  of  the  line  could  be  obtained  by  imagining  it  to 
be  formed  simply  by  the  rational  points  which  lie  on  it.  And  it  is  certainly 
the  case  that  if  we  imagine  the  line  to  be  made  up  solely  of  the  rational 
points,  and  all  other  points  (if  there  are  any  such)  to  be  eliminated,  the 
figure  which  remained  would  possess  most  of  the  properties  which  common 
sense  attributes  to  the  straight  line,  and  would,  to  put  the  matter  roughly, 
look  and  behave  very  much  like  a  line. 

A  little  further  consideration,  however,  shows  that  this  view  would 
involve  us  in  serious  difficulties. 

Let  us  look  at  the  matter  for  a  moment  with  the  eye  of  common  sense, 
and  consider  some  of  the  properties  which  we  may  reasonably  expect  a 
straight  line  to  possess  if  it  is  to  satisfy  the  idea  which  we  have  formed  of 
it  in  elementary  geometry. 

The  straight  line  must  be  composed  of  points,  and  any  segment  of  it  by 
all  the  points  which  lie  between  its  end  points.  With  any  such  segment 
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must  be  associated  a  certain  entity  called  its  length ,  which  must  be  a 
quantity  capable  of  numerical  measurement  in  terms  of  any  standard  or 
unit  length,  and  these  lengths  must  be  capable  of  combination  with  one 
another,  according  to  the  ordinary  rules  of  algebra,  by  means  of  addition  or 
multiplication.  Again,  it  must  be  possible  to  construct  a  line  whose  length 
is  the  sum  or  product  of  any  two  given  lengths.  If  the  length  PQ,  along 
a  given  line,  is  a,  and  the  length  QR ,  along  the  same  straight  line,  is  b, 
the  length  PR  must  be  a  +  b.  Moreover,  if  the  lengths  OP,  OQ,  along  one 
straight  line,  are  1  and  a,  and  the  length  OR  along  another  straight  line  is  b, 
and  if  we  determine  the  length  OS  by  Euclid’s  construction  (Euc.  VI.  12) 
for  a  fourth  proportional  to  the  lines  OP,  OQ,  OR,  this  length  must  be  ab, 
the  algebraical  fourth  proportional  to  1,  a,  b.  And  it  is  hardly  necessary  to 
remark  that  the  sums  and  products  thus  defined  must  obey  the  ordinary 
‘laws  of  algebra’;  viz. 

a  +  b  =  b  +  a,  a  +  (b  +  c)  =  (a  +  b)  +  c, 
ab  =  ba,  a{bc )  =  ( ab)c ,  a[b  +  c)  =  ab  +  ac. 

The  lengths  of  our  lines  must  also  obey  a  number  of  obvious  laws  concerning 
inequalities  as  well  as  equalities:  thus  if  A,  B,  C  are  three  points  lying 
along  A  from  left  to  right,  we  must  have  AB  <  AC ,  and  so  on.  Moreover 
it  must  be  possible,  on  our  fundamental  line  A,  to  find  a  point  P  such 
that  AqP  is  equal  to  any  segment  whatever  taken  along  A  or  along  any 
other  straight  line.  All  these  properties  of  a  line,  and  more,  are  involved 
in  the  presuppositions  of  our  elementary  geometry. 

Now  it  is  very  easy  to  see  that  the  idea  of  a  straight  line  as  composed  of 
a  series  of  points,  each  corresponding  to  a  rational  number,  cannot  possibly 
satisfy  all  these  requirements.  There  are  various  elementary  geometrical 
constructions,  for  example,  which  purport  to  construct  a  length  x  such 
that  x 2  =  2.  For  instance,  we  may  construct  an  isosceles  right-angled  tri¬ 
angle  ABC  such  that  AB  =  AC  =  1.  Then  if  BC  =  x,  x2  =  2.  Or  we  may 
determine  the  length  x  by  means  of  Euclid’s  construction  (Euc.  VI.  13)  for 
a  mean  proportional  to  1  and  2,  as  indicated  in  the  figure.  Our  require¬ 
ments  therefore  involve  the  existence  of  a  length  measured  by  a  number  x, 
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A  1  B  L  2  MIN 

Fig.  2. 

and  a  point  P  on  A  such  that 

A0P  =  x,  x2  =  2. 

But  it  is  easy  to  see  that  there  is  no  rational  number  such  that  its  square 
is  2.  In  fact  we  may  go  further  and  say  that  there  is  no  rational  number 
whose  square  is  m/n ,  where  m/n  is  any  positive  fraction  in  its  lowest  terms, 
unless  m  and  n  are  both  perfect  squares. 

For  suppose,  if  possible,  that 

p2 / q2  =  m/n, 

p  having  no  factor  in  common  with  q,  and  m  no  factor  in  common  with  n. 
Then  np 2  =  mq2.  Every  factor  of  q2  must  divide  np 2 ,  and  as  p  and  q 
have  no  common  factor,  every  factor  of  q 2  must  divide  n.  Hence  n  =  A q2, 
where  A  is  an  integer.  But  this  involves  m  =  A p2:  and  as  m  and  n  have 
no  common  factor,  A  must  be  unity.  Thus  m  —  p2,  n  —  q2,  as  was  to  be 
proved.  In  particular  it  follows,  by  taking  n  —  1,  that  an  integer  cannot 
be  the  square  of  a  rational  number,  unless  that  rational  number  is  itself 
integral. 

It  appears  then  that  our  requirements  involve  the  existence  of  a  num¬ 
ber  x  and  a  point  P ,  not  one  of  the  rational  points  already  constructed, 
such  that  A0P  =  x,  x2  =  2;  and  (as  the  reader  will  remember  from  ele¬ 
mentary  algebra)  we  write  x  =  y/2. 

The  following  alternative  proof  that  no  rational  number  can  have  its  square 
equal  to  2  is  interesting. 
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Suppose,  if  possible,  that  p/q  is  a  positive  fraction,  in  its  lowest  terms,  such 
that  (p/q)2  =  2  or  p2  =  2 q2.  It  is  easy  to  see  that  this  involves  (2 q  —  p)2  = 
2 (p  —  q)2;  and  so  (2 q  —  p)/(p  —  q)  is  another  fraction  having  the  same  property. 
But  clearly  q  <  p  <  2q,  and  so  p  —  q  <  q.  Hence  there  is  another  fraction  equal 
to  p/q  and  having  a  smaller  denominator,  which  contradicts  the  assumption  that 
p/q  is  in  its  lowest  terms. 

Examples  II.  1.  Show  that  no  rational  number  can  have  its  cube  equal 
to  2. 

2.  Prove  generally  that  a  rational  fraction  p/q  in  its  lowest  terms  cannot 
be  the  cube  of  a  rational  number  unless  p  and  q  are  both  perfect  cubes. 

3.  A  more  general  proposition,  which  is  due  to  Gauss  and  includes  those 
which  precede  as  particular  cases,  is  the  following:  an  algebraical  equation 

Xn  +  Pixn~l  +  P2Xn~2  H - b  Pn  =  0, 

with  integral  coefficients,  cannot  have  a  rational  but  non-integral  root. 

[For  suppose  that  the  equation  has  a  root  a/b ,  where  a  and  b  are  integers 
without  a  common  factor,  and  b  is  positive.  Writing  a/b  for  x,  and  multiplying 
by  bn~1,  we  obtain 

n 

=  p\an~l  +  p2an~2b  4 - \-pnbn~l, 

b 

a  fraction  in  its  lowest  terms  equal  to  an  integer,  which  is  absurd.  Thus  6=1, 
and  the  root  is  a.  It  is  evident  that  a  must  be  a  divisor  of  pn.\ 

4.  Show  that  if  pn  =  1  and  neither  of 

1  +  pi  +  p2  +  P3  +  •  •  • ,  1  -  Pi  +  P2  ~  P3  +  •  •  • 

is  zero,  then  the  equation  cannot  have  a  rational  root. 

5.  Find  the  rational  roots  (if  any)  of 

x 4  —  4.x3  —  8x2  +  13x  +  10  =  0. 

[The  roots  can  only  be  integral,  and  so  ±1,  ±2,  ±5,  ±10  are  the  only  possi¬ 
bilities:  whether  these  are  roots  can  be  determined  by  trial.  It  is  clear  that  we 
can  in  this  way  determine  the  rational  roots  of  any  such  equation.] 
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4.  Irrational  numbers  (continued) .  The  result  of  our  geometrical 
representation  of  the  rational  numbers  is  therefore  to  suggest  the  desirabil¬ 
ity  of  enlarging  our  conception  of  ‘number’  by  the  introduction  of  further 
numbers  of  a  new  kind. 

The  same  conclusion  might  have  been  reached  without  the  use  of  ge¬ 
ometrical  language.  One  of  the  central  problems  of  algebra  is  that  of  the 
solution  of  equations,  such  as 

x 2  =  1,  x2  =  2. 

The  first  equation  has  the  two  rational  roots  1  and  —1.  But,  if  our  con¬ 
ception  of  number  is  to  be  limited  to  the  rational  numbers,  we  can  only 
say  that  the  second  equation  has  no  roots;  and  the  same  is  the  case  with 
such  equations  as  a:3  =  2,  xA  =  7.  These  facts  are  plainly  sufficient  to  make 
some  generalisation  of  our  idea  of  number  desirable,  if  it  should  prove  to 
be  possible. 

Let  us  consider  more  closely  the  equation  x2  =  2. 

We  have  already  seen  that  there  is  no  rational  number  x  which  satisfies 
this  equation.  The  square  of  any  rational  number  is  either  less  than  or 
greater  than  2.  We  can  therefore  divide  the  rational  numbers  into  two 
classes,  one  containing  the  numbers  whose  squares  are  less  than  2,  and 
the  other  those  whose  squares  are  greater  than  2.  We  shall  confine  our 
attention  to  the  positive  rational  numbers,  and  we  shall  call  these  two 
classes  the  class  L,  or  the  lower  class,  or  the  left-hand  class,  and  the  class  R, 
or  the  upper  class ,  or  the  right-hand  class.  It  is  obvious  that  every  member 
of  R  is  greater  than  all  the  members  of  L.  Moreover  it  is  easy  to  convince 
ourselves  that  we  can  find  a  member  of  the  class  L  whose  square,  though 
less  than  2,  differs  from  2  by  as  little  as  we  please,  and  a  member  of  R 
whose  square,  though  greater  than  2,  also  differs  from  2  by  as  little  as  we 
please.  In  fact,  if  we  carry  out  the  ordinary  arithmetical  process  for  the 
extraction  of  the  square  root  of  2,  we  obtain  a  series  of  rational  numbers, 
viz. 

1,  1.4,  1.41,  1.414,  1.4142,  ... 

whose  squares 

1,  1.96,  1.9881,  1.999  396,  1.999  96164,  ... 
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are  all  less  than  2,  but  approach  nearer  and  nearer  to  it;  and  by  taking  a 
sufficient  number  of  the  figures  given  by  the  process  we  can  obtain  as  close 
an  approximation  as  we  want.  And  if  we  increase  the  last  figure,  in  each 
of  the  approximations  given  above,  by  unity,  we  obtain  a  series  of  rational 
numbers 

2,  1.5,  1.42,  1.415,  1.4143,  ... 

whose  squares 

4,  2.25,  2.0164,  2.002  225,  2.000  24449,  ... 

are  all  greater  than  2  but  approximate  to  2  as  closely  as  we  please. 

The  reasoning  which  precedes,  although  it  will  probably  convince  the  reader, 
is  hardly  of  the  precise  character  required  by  modern  mathematics.  We  can 
supply  a  formal  proof  as  follows.  In  the  first  place,  we  can  find  a  member  of  L 
and  a  member  of  R,  differing  by  as  little  as  we  please.  For  we  saw  in  §  3  that, 
given  any  two  rational  numbers  a  and  b,  we  can  construct  a  chain  of  rational 
numbers,  of  which  a  and  b  are  the  first  and  last,  and  in  which  any  two  consecutive 
numbers  differ  by  as  little  as  we  please.  Let  us  then  take  a  member  x  of  L  and 
a  member  y  of  R ,  and  interpolate  between  them  a  chain  of  rational  numbers  of 
which  x  is  the  first  and  y  the  last,  and  in  which  any  two  consecutive  numbers 
differ  by  less  than  <5,  5  being  any  positive  rational  number  as  small  as  we  please, 
such  as  .01  or  .0001  or  .000  001.  In  this  chain  there  must  be  a  last  which  belongs 
to  L  and  a  first  which  belongs  to  R,  and  these  two  numbers  differ  by  less  than  6. 

We  can  now  prove  that  an  x  can  be  found  in  L  and  a  y  in  R  such  that  2  —  x2 
and  y2  —  2  are  as  small  as  we  please ,  say  less  than  5.  Substituting  \8  for  5  in  the 
argument  which  precedes,  we  see  that  we  can  choose  x  and  y  so  that  y  —  x  <  |<5; 
and  we  may  plainly  suppose  that  both  x  and  y  are  less  than  2.  Thus 

y  +  x  <  4,  y2  -  x2  =  (y  —  x)(y  +  x)  <  4 (y  -  x)  <  5; 

and  since  x2  <  2  and  y2  >  2  it  follows  a  fortiori  that  2  —  x 2  and  y2  —  2  are  each 
less  than  5. 

It  follows  also  that  there  can  be  no  largest  member  of  L  or  smallest 
member  of  R.  For  if  x  is  any  member  of  L,  then  x 2  <  2.  Suppose  that 
x2  =  2  —  8.  Then  we  can  find  a  member  X\  of  L  such  that  x2  differs  from  2 
by  less  than  6,  and  so  xf  >  x2  or  x\  >  x.  Thus  there  are  larger  members 
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of  L  than  x;  and  as  x  is  any  member  of  L,  it  follows  that  no  member 
of  L  can  be  larger  than  all  the  rest.  Hence  L  has  no  largest  member,  and 
similarly  R  has  no  smallest. 

5.  Irrational  numbers  (continued) .  We  have  thus  divided  the  posi¬ 
tive  rational  numbers  into  two  classes,  L  and  R,  such  that  (i)  every  member 
of  R  is  greater  than  every  member  of  L ,  (ii)  we  can  find  a  member  of  L 
and  a  member  of  R  whose  difference  is  as  small  as  we  please,  (iii)  L  has 
no  greatest  and  R  no  least  member.  Our  common-sense  notion  of  the  at¬ 
tributes  of  a  straight  line,  the  requirements  of  our  elementary  geometry  and 
our  elementary  algebra,  alike  demand  the  existence  of  a  number  x  greater 
than  all  the  members  of  L  and  less  than  all  the  members  of  R,  and  of  a  cor¬ 
responding  point  P  on  A  such  that  P  divides  the  points  which  correspond 
to  members  of  L  from  those  which  correspond  to  members  of  R. 


1 

L 

i 

L  LLL 

1  III 

RR  R  R 

_ i i i _ i _ 

R 

_ i _ 

Ao 

1 

1  III 

I  I  I  I  I 

P 

Fig.  3. 

Let  us  suppose  for  a  moment  that  there  is  such  a  number  x,  and  that  it 
may  be  operated  upon  in  accordance  with  the  laws  of  algebra,  so  that,  for 
example,  x2  has  a  definite  meaning.  Then  x2  cannot  be  either  less  than  or 
greater  than  2.  For  suppose,  for  example,  that  x2  is  less  than  2.  Then  it 
follows  from  what  precedes  that  we  can  find  a  positive  rational  number  f 
such  that  f2  lies  between  x2  and  2.  That  is  to  say,  we  can  find  a  member 
of  L  greater  than  x\  and  this  contradicts  the  supposition  that  x  divides  the 
members  of  L  from  those  of  R.  Thus  x2  cannot  be  less  than  2,  and  similarly 
it  cannot  be  greater  than  2.  We  are  therefore  driven  to  the  conclusion  that 
x2  =  2,  and  that  x  is  the  number  which  in  algebra  we  denote  by  \/2.  And 
of  course  this  number  \[2  is  not  rational,  for  no  rational  number  has  its 
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square  equal  to  2.  It  is  the  simplest  example  of  what  is  called  an  irrational 
number. 

But  the  preceding  argument  may  be  applied  to  equations  other  than 
x2  =  2,  almost  word  for  word;  for  example  to  x2  =  N,  where  N  is  any 
integer  which  is  not  a  perfect  square,  or  to 

a:3  =  3,  x3  =  7,  x4  =  23, 

or,  as  we  shall  see  later  on,  to  x3  =  3x  +  8.  We  are  thus  led  to  believe  in  the 
existence  of  irrational  numbers  x  and  points  P  on  A  such  that  x  satisfies 
equations  such  as  these,  even  when  these  lengths  cannot  (as  \/2  can)  be 
constructed  by  means  of  elementary  geometrical  methods. 

The  reader  will  no  doubt  remember  that  in  treatises  on  elementary  algebra 
the  root  of  such  an  equation  as  xq  =  n  is  denoted  by  \/n  or  n1//<?,  and  that  a 
meaning  is  attached  to  such  symbols  as 

np/q ,  n~p/q 

by  means  of  the  equations 

nP/q  =  np/qn~p/q  =  1. 

And  he  will  remember  how,  in  virtue  of  these  definitions,  the  ‘laws  of  indices’ 
such  as 

nr  x  ns  =  nr+s,  ( nr)s  =  nrs 

are  extended  so  as  to  cover  the  case  in  which  r  and  s  are  any  rational  numbers 
whatever. 

The  reader  may  now  follow  one  or  other  of  two  alternative  courses.  He 
may,  if  he  pleases,  be  content  to  assume  that  ‘irrational  numbers’  such 
as  y/2,  v^3,  •  •  •  exist  and  are  amenable  to  the  algebraical  laws  with  which 
he  is  familiar.*  If  he  does  this  he  will  be  able  to  avoid  the  more  abstract 
discussions  of  the  next  few  sections,  and  may  pass  on  at  once  to  §§  13  et  seq. 

If,  on  the  other  hand,  he  is  not  disposed  to  adopt  so  naive  an  attitude, 
he  will  be  well  advised  to  pay  careful  attention  to  the  sections  which  follow, 
in  which  these  questions  receive  fuller  consideration.' 

*This  is  the  point  of  view  which  was  adopted  in  the  first  edition  of  this  book. 

Tn  these  sections  I  have  borrowed  freely  from  Appendix  I  of  Bromwich’s  Infinite 
Series. 
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Examples  III.  1.  Find  the  difference  between  2  and  the  squares  of  the 
decimals  given  in  §  4  as  approximations  to  \/2. 

2.  Find  the  differences  between  2  and  the  squares  of 

1  3  7  17  41  99 

1’  2’  5’  12’  29’  70‘ 

3.  Show  that  if  m/n  is  a  good  approximation  to  y/2,  then  (m  +  2n)  /{m  +  n) 
is  a  better  one,  and  that  the  errors  in  the  two  cases  are  in  opposite  directions. 
Apply  this  result  to  continue  the  series  of  approximations  in  the  last  example. 

4.  If  x  and  y  are  approximations  to  y/2,  by  defect  and  by  excess  respectively, 
and  2  —  x2  <  S,  y2  —  2  <  <5,  then  y  —  x  <  5. 

5.  The  equation  x2  =  4  is  satisfied  by  x  =  2.  Examine  how  far  the  argument 
of  the  preceding  sections  applies  to  this  equation  (writing  4  for  2  throughout). 
[If  we  define  the  classes  L,  R  as  before,  they  do  not  include  all  rational  numbers. 
The  rational  number  2  is  an  exception,  since  22  is  neither  less  than  or  greater 
than  4.] 

6.  Irrational  numbers  ( continued ).  In  §  4  we  discussed  a  special 
mode  of  division  of  the  positive  rational  numbers  x  into  two  classes,  such 
that  x2  <  2  for  the  members  of  one  class  and  x2  >  2  for  those  of  the  others. 
Such  a  mode  of  division  is  called  a  section  of  the  numbers  in  question.  It 
is  plain  that  we  could  equally  well  construct  a  section  in  which  the  numbers 
of  the  two  classes  were  characterised  by  the  inequalities  x3  <  2  and  x3  >  2, 
or  x4  <  7  and  x4  >  7.  Let  us  now  attempt  to  state  the  principles  of  the 
construction  of  such  ‘sections’  of  the  positive  rational  numbers  in  quite 
general  terms. 

Suppose  that  P  and  Q  stand  for  two  properties  which  are  mutually 
exclusive  and  one  of  which  must  be  possessed  by  every  positive  rational 
number.  Further,  suppose  that  every  such  number  which  possesses  P  is  less 
than  any  such  number  which  possesses  Q.  Thus  P  might  be  the  property 
lx2  <  2’  and  Q  the  property  lx2  >  2.’  Then  we  call  the  numbers  which 
possess  P  the  lower  or  left-hand  class  L  and  those  which  possess  Q  the 
upper  or  right-hand  class  R.  In  general  both  classes  will  exist;  but  it  may 
happen  in  special  cases  that  one  is  non-existent  and  that  every  number 
belongs  to  the  other.  This  would  obviously  happen,  for  example,  if  P 
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(or  Q)  were  the  property  of  being  rational,  or  of  being  positive.  For  the 
present,  however,  we  shall  confine  ourselves  to  cases  in  which  both  classes 
do  exist;  and  then  it  follows,  as  in  §  4,  that  we  can  find  a  member  of  L  and 
a  member  of  R  whose  difference  is  as  small  as  we  please. 

In  the  particular  case  which  we  considered  in  §  4,  L  had  no  greatest 
member  and  R  no  least.  This  question  of  the  existence  of  greatest  or  least 
members  of  the  classes  is  of  the  utmost  importance.  We  observe  first  that 
it  is  impossible  in  any  case  that  L  should  have  a  greatest  member  and 
R  a  least.  For  if  l  were  the  greatest  member  of  L ,  and  r  the  least  of  R, 
so  that  l  <  r,  then  ^(/  +  r)  would  be  a  positive  rational  number  lying 
between  l  and  r,  and  so  could  belong  neither  to  L  nor  to  R ;  and  this 
contradicts  our  assumption  that  every  such  number  belongs  to  one  class 
or  to  the  other.  This  being  so,  there  are  but  three  possibilities,  which  are 
mutually  exclusive.  Either  (i)  L  has  a  greatest  member  /,  or  (ii)  R  has  a 
least  member  r,  or  (iii)  L  has  no  greatest  member  and  R  no  least. 

The  section  of  §  4  gives  an  example  of  the  last  possibility.  An  example  of 
the  first  is  obtained  by  taking  P  to  be  ‘ x 2  ^  1’  and  Q  to  be  ‘x2  >1';  here  1  =  1. 
If  P  is  ‘x2  <  I’  and  Q  is  ‘x2  ^  1’,  we  have  an  example  of  the  second  possibility, 
with  r  =  1.  It  should  be  observed  that  we  do  not  obtain  a  section  at  all  by 
taking  P  to  be  ‘x2  <  1’  and  Q  to  be  ‘x2  >1’;  for  the  special  number  1  escapes 
classification  (cf.  Ex.  ill.  5). 

7.  Irrational  numbers  ( continued ).  In  the  first  two  cases  we  say 
that  the  section  corresponds  to  a  positive  rational  number  a,  which  is  l  in 
the  one  case  and  r  in  the  other.  Conversely,  it  is  clear  that  to  any  such 
number  a  corresponds  a  section  which  we  shall  denote  by  a.*  For  we  might 
take  P  and  Q  to  be  the  properties  expressed  by 

x  ^  a,  x  >  a 

respectively,  or  by  x  <  a  and  x  ^  a.  In  the  first  case  a  would  be  the 
greatest  member  of  L ,  and  in  the  second  case  the  least  member  of  R. 

*It  will  be  convenient  to  denote  a  section,  corresponding  to  a  rational  number  de¬ 
noted  by  an  English  letter,  by  the  corresponding  Greek  letter. 
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There  are  in  fact  just  two  sections  corresponding  to  any  positive  rational 
number.  In  order  to  avoid  ambiguity  we  select  one  of  them;  let  us  select 
that  in  which  the  number  itself  belongs  to  the  upper  class.  In  other  words, 
let  us  agree  that  we  will  consider  only  sections  in  which  the  lower  class  L 
has  no  greatest  number. 

There  being  this  correspondence  between  the  positive  rational  numbers 
and  the  sections  defined  by  means  of  them,  it  would  be  perfectly  legitimate, 
for  mathematical  purposes,  to  replace  the  numbers  by  the  sections,  and  to 
regard  the  symbols  which  occur  in  our  formulae  as  standing  for  the  sections 
instead  of  for  the  numbers.  Thus,  for  example,  a  >  a'  would  mean  the 
same  as  a  >  a',  if  a  and  a'  are  the  sections  which  correspond  to  a  and  a'. 

But  when  we  have  in  this  way  substituted  sections  of  rational  numbers 
for  the  rational  numbers  themselves,  we  are  almost  forced  to  a  generali¬ 
sation  of  our  number  system.  For  there  are  sections  (such  as  that  of  §  4) 
which  do  not  correspond  to  any  rational  number.  The  aggregate  of  sec¬ 
tions  is  a  larger  aggregate  than  that  of  the  positive  rational  numbers;  it 
includes  sections  corresponding  to  all  these  numbers,  and  more  besides.  It 
is  this  fact  which  we  make  the  basis  of  our  generalisation  of  the  idea  of 
number.  We  accordingly  frame  the  following  definitions,  which  will  how¬ 
ever  be  modified  in  the  next  section,  and  must  therefore  be  regarded  as 
temporary  and  provisional. 

A  section  of  the  positive  rational  numbers,  in  which  both  classes  exist 
and  the  lower  class  has  no  greatest  member,  is  called  a  positive  real 
number. 

A  positive  real  number  which  does  not  correspond  to  a  positive  rational 
number  is  called  a  positive  irrational  number. 


8.  Real  numbers.  We  have  confined  ourselves  so  far  to  certain  sec¬ 
tions  of  the  positive  rational  numbers,  which  we  have  agreed  provisionally 
to  call  ‘positive  real  numbers.’  Before  we  frame  our  final  definitions,  we 
must  alter  our  point  of  view  a  little.  We  shall  consider  sections,  or  divisions 
into  two  classes,  not  merely  of  the  positive  rational  numbers,  but  of  all  ra¬ 
tional  numbers,  including  zero.  We  may  then  repeat  all  that  we  have  said 
about  sections  of  the  positive  rational  numbers  in  §§  6,  7,  merely  omitting 
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the  word  positive  occasionally. 

Definitions.  A  section  of  the  rational  numbers,  in  which  both  classes 
exist  and  the  lower  class  has  no  greatest  member,  is  called  a  real  number, 
or  simply  a  number. 

A  real  number  which  does  not  correspond  to  a  rational  number  is  called 

an  irrational  number. 

If  the  real  number  does  correspond  to  a  rational  number,  we  shall  use 
the  term  ‘rational’  as  applying  to  the  real  number  also. 

The  term  ‘rational  number’  will,  as  a  result  of  our  definitions,  be  ambiguous; 
it  may  mean  the  rational  number  of  §  1,  or  the  corresponding  real  number.  If  we 
say  that  |  >  1,  we  may  be  asserting  either  of  two  different  propositions,  one  a 
proposition  of  elementary  arithmetic,  the  other  a  proposition  concerning  sections 
of  the  rational  numbers.  Ambiguities  of  this  kind  are  common  in  mathematics, 
and  are  perfectly  harmless,  since  the  relations  between  different  propositions 
are  exactly  the  same  whichever  interpretation  is  attached  to  the  propositions 
themselves.  From  ^  |  and  ^  >  j  we  can  infer  ^  the  inference  is  in  no 

way  affected  by  any  doubt  as  to  whether  and  \  are  arithmetical  fractions 
or  real  numbers.  Sometimes,  of  course,  the  context  in  which  ( e.g .)  ‘  ^ '  occurs  is 

sufficient  to  fix  its  interpretation.  When  we  say  (see  §  9)  that  ^  we  must 

mean  by  the  real  number 

The  reader  should  observe,  moreover,  that  no  particular  logical  importance 
is  to  be  attached  to  the  precise  form  of  definition  of  a  ‘real  number’  that  we  have 
adopted.  We  defined  a  ‘real  number’  as  being  a  section,  i.e.  a  pair  of  classes.  We 
might  equally  well  have  defined  it  as  being  the  lower,  or  the  upper,  class;  indeed 
it  would  be  easy  to  define  an  infinity  of  classes  of  entities  each  of  which  would 
possess  the  properties  of  the  class  of  real  numbers.  What  is  essential  in  math¬ 
ematics  is  that  its  symbols  should  be  capable  of  some  interpretation;  generally 
they  are  capable  of  many ,  and  then,  so  far  as  mathematics  is  concerned,  it  does 
not  matter  which  we  adopt.  Mr  Bertrand  Russell  has  said  that  ‘mathematics 
is  the  science  in  which  we  do  not  know  what  we  are  talking  about,  and  do  not 
care  whether  what  we  say  about  it  is  true’,  a  remark  which  is  expressed  in  the 
form  of  a  paradox  but  which  in  reality  embodies  a  number  of  important  truths. 
It  would  take  too  long  to  analyse  the  meaning  of  Mr  Russell’s  epigram  in  detail, 
but  one  at  any  rate  of  its  implications  is  this,  that  the  symbols  of  mathematics 
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are  capable  of  varying  interpretations,  and  that  we  are  in  general  at  liberty  to 
adopt  whichever  we  prefer. 

There  are  now  three  cases  to  distinguish.  It  may  happen  that  all  neg¬ 
ative  rational  numbers  belong  to  the  lower  class  and  zero  and  all  positive 
rational  numbers  to  the  upper.  We  describe  this  section  as  the  real  num¬ 
ber  zero.  Or  again  it  may  happen  that  the  lower  class  includes  some 
positive  numbers.  Such  a  section  we  describe  as  a  positive  real  number. 
Finally  it  may  happen  that  some  negative  numbers  belong  to  the  upper 
class.  Such  a  section  we  describe  as  a  negative  real  number.* 

The  difference  between  our  present  definition  of  a  positive  real  number  a  and 
that  of  §  7  amounts  to  the  addition  to  the  lower  class  of  zero  and  all  the  negative 
rational  numbers.  An  example  of  a  negative  real  number  is  given  by  taking  the 
property  P  of  §  6  to  be  x  +  1  <  0  and  Q  to  be  x  +  1  ^  0.  This  section  plainly 
corresponds  to  the  negative  rational  number  —1.  If  we  took  P  to  be  r3  <  —2 
and  Q  to  be  a:3  >  —2,  we  should  obtain  a  negative  real  number  which  is  not 
rational. 


9.  Relations  of  magnitude  between  real  numbers.  It  is  plain 

that,  now  that  we  have  extended  our  conception  of  number,  we  are  bound 
to  make  corresponding  extensions  of  our  conceptions  of  equality,  inequality, 
addition,  multiplication,  and  so  on.  We  have  to  show  that  these  ideas  can 
be  applied  to  the  new  numbers,  and  that,  when  this  extension  of  them 
is  made,  all  the  ordinary  laws  of  algebra  retain  their  validity,  so  that  we 
can  operate  with  real  numbers  in  general  in  exactly  the  same  way  as  with 
the  rational  numbers  of  §  1.  To  do  all  this  systematically  would  occupy  a 

*  There  are  also  sections  in  which  every  number  belongs  to  the  lower  or  to  the  upper 
class.  The  reader  may  be  tempted  to  ask  why  we  do  not  regard  these  sections  also  as 
defining  numbers,  which  we  might  call  the  real  numbers  positive  and  negative  infinity. 

There  is  no  logical  objection  to  such  a  procedure,  but  it  proves  to  be  inconvenient 
in  practice.  The  most  natural  definitions  of  addition  and  multiplication  do  not  work 
in  a  satisfactory  way.  Moreover,  for  a  beginner,  the  chief  difficulty  in  the  elements 
of  analysis  is  that  of  learning  to  attach  precise  senses  to  phrases  containing  the  word 
‘infinity’;  and  experience  seems  to  show  that  he  is  likely  to  be  confused  by  any  addition 
to  their  number. 
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considerable  space,  and  we  shall  be  content  to  indicate  summarily  how  a 
more  systematic  discussion  would  proceed. 

We  denote  a  real  number  by  a  Greek  letter  such  as  ct,  /3,  7, ... ;  the 
rational  numbers  of  its  lower  and  upper  classes  by  the  corresponding  En¬ 
glish  letters  a,  A ;  6,  £>;  c,  C\  ....  The  classes  themselves  we  denote  by 
(a),  (A),.... 

If  a  and  ft  are  two  real  numbers,  there  are  three  possibilities: 

(i)  every  a  is  a  b  and  every  A  a  B;  in  this  case  (a)  is  identical  with  (fr) 
and  (A)  with  (B); 

(ii)  every  a  is  a  6,  but  not  all  Tl’s  are  B'  s;  in  this  case  (a)  is  a  proper 
part  of  ( b ),*  and  ( B )  a  proper  part  of  (A); 

(iii)  every  A  is  a  B,  but  not  all  a’s  are  6’s. 

These  three  cases  may  be  indicated  graphically  as  in  Fig.  4. 

In  case  (i)  we  write  a  =  (3,  in  case  (ii)  a  <  (3,  and  in  case  (iii)  a  >  (3. 
It  is  clear  that,  when  a  and  /3  are  both  rational,  these  definitions  agree 
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Fig.  4. 

with  the  ideas  of  equality  and  inequality  between  rational  numbers  which 
we  began  by  taking  for  granted;  and  that  any  positive  number  is  greater 
than  any  negative  number. 

It  will  be  convenient  to  define  at  this  stage  the  negative  —a  of  a  positive 
number  a.  If  (a),  (A)  are  the  classes  which  constitute  a,  we  can  define 
another  section  of  the  rational  numbers  by  putting  all  numbers  —A  in  the 
lower  class  and  all  numbers  —a  in  the  upper.  The  real  number  thus  defined, 
which  is  clearly  negative,  we  denote  by  —a.  Similarly  we  can  define  —a 

*I.e.  is  included  in  but  not  identical  with  (6). 
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when  a  is  negative  or  zero;  if  a  is  negative,  —a  is  positive.  It  is  plain  also 
that  —(—a)  =  a.  Of  the  two  numbers  a  and  —a  one  is  always  positive 
(unless  a  =  0).  The  one  which  is  positive  we  denote  by  |a|  and  call  the 
modulus  of  a. 

Examples  IV.  1.  Prove  that  0  =  —0. 

2.  Prove  that  /3  =  a,  /3  <  a,  or  /3  >  a  according  as  a  =  /3,  a  >  /3,  or  a  <  f3. 

3.  If  a  =  (3  and  /3  =  7,  then  0  =  7. 

4.  If  a  A  (3,  (3  <  7,  or  a  <  f5,  f3  A  7,  then  a  <  7. 

5.  Prove  that  —f3  =  —a,  —/3  <  —a,  or  —  (3  >  —a,  according  as  a  =  f3, 

a  <  /3,  or  a  >  j3. 

6.  Prove  that  a  >  0  if  a  is  positive,  and  a  <  0  if  a  is  negative. 

7.  Prove  that  a  ^  |a|. 

8.  Prove  that  1  <  \/2  <  \/3  <  2. 

9.  Prove  that,  if  a  and  (3  are  two  different  real  numbers,  we  can  always  find 
an  infinity  of  rational  numbers  lying  between  a  and  f3. 

[All  these  results  are  immediate  consequences  of  our  definitions.] 

10.  Algebraical  operations  with  real  numbers.  We  now  proceed 
to  define  the  meaning  of  the  elementary  algebraical  operations  such  as 
addition,  as  applied  to  real  numbers  in  general. 

(i)  Addition.  In  order  to  define  the  sum  of  two  numbers  a  and  (3, 
we  consider  the  following  two  classes:  (i)  the  class  (c)  formed  by  all  sums 
c  =  a  +  b,  (ii)  the  class  (C)  formed  by  all  sums  C  =  A  +  B.  Plainly  c  <  C 
in  all  cases. 

Again,  there  cannot  be  more  than  one  rational  number  which  does  not 
belong  either  to  (c)  or  to  (C).  For  suppose  there  were  two,  say  r  and  s, 
and  let  s  be  the  greater.  Then  both  r  and  s  must  be  greater  than  every  c 
and  less  than  every  C;  and  soG-c  cannot  be  less  than  s  —  r.  But 

C  —  c  =  (A  —  a)  +  (B  —  6); 

and  we  can  choose  a,  b ,  A,  B  so  that  both  A  —  a  and  B  —  6  are  as  small  as 
we  like;  and  this  plainly  contradicts  our  hypothesis. 
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If  every  rational  number  belongs  to  (c)  or  to  (C),  the  classes  (c),  (C) 
form  a  section  of  the  rational  numbers,  that  is  to  say,  a  number  7.  If  there 
is  one  which  does  not,  we  add  it  to  (C).  We  have  now  a  section  or  real 
number  7,  which  must  clearly  be  rational,  since  it  corresponds  to  the  least 
member  of  (C).  In  any  case  we  call  7  the  sum  of  a  and  /3,  and  write 

7  =  a  +  (3. 

If  both  a  and  /3  are  rational,  they  are  the  least  members  of  the  upper  classes 
(A)  and  ( B ).  In  this  case  it  is  clear  that  o  +  (3  is  the  least  member  of  (C),  so 
that  our  definition  agrees  with  our  previous  ideas  of  addition. 

(ii)  Subtraction.  We  define  a  —  f3  by  the  equation 

a  —  (3  =  a  +  (—/?). 

The  idea  of  subtraction  accordingly  presents  no  fresh  difficulties. 

Examples  V.  1.  Prove  that  a  +  (— a)  =  0. 

2.  Prove  that  a  +  0  =  0  +  a  =  a. 

3.  Prove  that  a  +  [3  =  (3  +  o.  [This  follows  at  once  from  the  fact  that  the 
classes  (a  +  b )  and  ( b  +  a),  or  (A  +  B)  and  ( B  +  A ),  are  the  same,  since,  e.g., 
a  +  b  =  b  +  a  when  a  and  b  are  rational.] 

4.  Prove  that  a  +  (/3  +  7)  =  (a  +  f3)  +  7- 

5.  Prove  that  a  —  a  =  0. 

6.  Prove  that  a  —  (3  =  —(/?  —  a). 

7.  From  the  definition  of  subtraction,  and  Exs.  4,  1,  and  2  above,  it  follows 
that 

(o  —  f3)  +  /?  =  {ct  +  (— /?)}  +  f3  =  ol  +  {(— /3)  +  /3}  =  q:  +  0  =  q:. 

We  might  therefore  define  the  difference  o  —  f5  =  7  by  the  equation  7  +  /3  =  <x 

8.  Prove  that  o  —  (/?  —  7)  =  o  —  /3  +  7. 

9.  Give  a  definition  of  subtraction  which  does  not  depend  upon  a  previous 
definition  of  addition.  [To  define  7  =  o  —  /3,  form  the  classes  (c),  ( C )  for  which 
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c  =  a  —  B,  C  =  A  —  b.  It  is  easy  to  show  that  this  definition  is  equivalent  to 
that  which  we  adopted  in  the  text.] 

10.  Prove  that 

||a|  —  |/?||  A  \a±  /3\  |a|  +  |/3|. 


11.  Algebraical  operations  with  real  numbers  ( continued ). 
(iii)  Multiplication.  When  we  come  to  multiplication,  it  is  most  con¬ 
venient  to  confine  ourselves  to  positive  numbers  (among  which  we  may 
include  0)  in  the  first  instance,  and  to  go  back  for  a  moment  to  the  sections 
of  positive  rational  numbers  only  which  we  considered  in  §§  4-7.  We  may 
then  follow  practically  the  same  road  as  in  the  case  of  addition,  taking  (c) 
to  be  ( ab )  and  (C)  to  be  ( AB ).  The  argument  is  the  same,  except  when 
we  are  proving  that  all  rational  numbers  with  at  most  one  exception  must 
belong  to  (c)  or  (C).  This  depends,  as  in  the  case  of  addition,  on  showing 
that  we  can  choose  a,  A,  b ,  and  B  so  that  C  —  c  is  as  small  as  we  please. 
Here  we  use  the  identity 

C  —  c  =  AB  —  ab  =  (A  —  a)B  +  a{B  —  b). 

Finally  we  include  negative  numbers  within  the  scope  of  our  definition 
by  agreeing  that,  if  a  and  f3  are  positive,  then 

(— a)(3  =  — a/3 ,  a{—(3)  =  —af3,  (—«)(—/?)  =  a/3. 

(iv)  Division.  In  order  to  define  division,  we  begin  by  defining  the 
reciprocal  1/a  of  a  number  a  (other  than  zero).  Confining  ourselves  in  the 
first  instance  to  positive  numbers  and  sections  of  positive  rational  numbers, 
we  define  the  reciprocal  of  a  positive  number  a  by  means  of  the  lower 
class  (1/A)  and  the  upper  class  (1/a).  We  then  define  the  reciprocal  of  a 
negative  number  —a  by  the  equation  l/(— a)  =  —{1/a).  Finally  we  define 
a/ /3  by  the  equation 


a/(3  =  a  x  (1  / f3). 
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We  are  then  in  a  position  to  apply  to  all  real  numbers,  rational  or 
irrational,  the  whole  of  the  ideas  and  methods  of  elementary  algebra.  Nat¬ 
urally  we  do  not  propose  to  carry  out  this  task  in  detail.  It  will  be  more 
profitable  and  more  interesting  to  turn  our  attention  to  some  special,  but 
particularly  important,  classes  of  irrational  numbers. 


Examples  VI.  Prove  the  theorems  expressed  by  the  following  formulae: 

1.  a  x  0  =  0  X  a  =  0.  5.  a(/3y)  =  (a/3)y. 

2.  a  x  1  =  1  x  a  =  a.  6.  a(/3  +  7)  =  a/3  +  ay. 

3.  ax(l/a)  =  l.  7.  (a  +  /3)y  =  ay  /3y. 

4.  a(3  =  /3a.  8.  \a/3\  =  |a|  \/3\. 


12.  The  number  y/2.  Let  us  now  return  for  a  moment  to  the  partic¬ 
ular  irrational  number  which  we  discussed  in  §§  4-5.  We  there  constructed 
a  section  by  means  of  the  inequalities  x 2  <  2,  x2  >  2.  This  was  a  section 
of  the  positive  rational  numbers  only;  but  we  replace  it  (as  was  explained 
in  §  8)  by  a  section  of  all  the  rational  numbers.  We  denote  the  section  or 
number  thus  defined  by  the  symbol  a/2. 

The  classes  by  means  of  which  the  product  of  a/2  by  itself  is  defined 
are  (i)  (aa'),  where  a  and  a!  are  positive  rational  numbers  whose  squares 
are  less  than  2,  (ii)  {AA1),  where  A  and  A '  are  positive  rational  numbers 
whose  squares  are  greater  than  2.  These  classes  exhaust  all  positive  rational 
numbers  save  one,  which  can  only  be  2  itself.  Thus 

(\/2)2  =  V2V2  =  2. 

Again 

(-V2)2  =  (-V2)(-V2)  =  =  (V2)2  =  2. 

Thus  the  equation  x2  =  2  has  the  two  roots  a/2  and  —a/2.  Similarly  we 
could  discuss  the  equations  x2  =  3,  x3  =  7, . . .  and  the  corresponding 
irrational  numbers  y/3,  —a/3,  a/7,  .... 
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13.  Quadratic  surds.  A  number  of  the  form  ±y/a,  where  a  is  a 
positive  rational  number  which  is  not  the  square  of  another  rational  num¬ 
ber,  is  called  a  pure  quadratic  surd.  A  number  of  the  form  a  ±  y/b,  where 
a  is  rational,  and  y/b  is  a  pure  quadratic  surd,  is  sometimes  called  a  mixed 
quadratic  surd. 

The  two  numbers  a  ±  y/b  are  the  roots  of  the  quadratic  equation 

x 2  —  2  ax  +  a2  —  b  =  0. 

Conversely,  the  equation  x2  +  2 px  +  q  =  0,  where  p  and  q  are  rational,  and 
p2  —  q  >  0,  has  as  its  roots  the  two  quadratic  surds  —  p  ±  \Jp2  —  q. 

The  only  kind  of  irrational  numbers  whose  existence  was  suggested  by 
the  geometrical  considerations  of  §  3  are  these  quadratic  surds,  pure  and 
mixed,  and  the  more  complicated  irrationals  which  may  be  expressed  in  a 
form  involving  the  repeated  extraction  of  square  roots,  such  as 


ft  is  easy  to  construct  geometrically  a  line  whose  length  is  equal  to 
any  number  of  this  form,  as  the  reader  will  easily  see  for  himself.  That 
irrational  numbers  of  these  kinds  only  can  be  constructed  by  Euclidean 
methods  (i.e.  by  geometrical  constructions  with  ruler  and  compasses)  is  a 
point  the  proof  of  which  must  be  deferred  for  the  present.*  This  property 
of  quadratic  surds  makes  them  especially  interesting. 

Examples  VII.  1.  Give  geometrical  constructions  for 

V2,  \j  2  +  y/2,  \J  2  +  \J  2  +  ^2. 

2.  The  quadratic  equation  ax2+2bx+c  =  0  has  two  real  roots^  if  b2—ac  >  0. 

*See  Ch.  II,  Misc.  Exs.  22. 

t I.e.  there  are  two  values  of  x  for  which  ax2  +  2 bx  +  c  =  0.  If  b2  —  ac  <  0  there 
are  no  such  values  of  x.  The  reader  will  remember  that  in  books  on  elementary  algebra 
the  equation  is  said  to  have  two  ‘complex’  roots.  The  meaning  to  be  attached  to  this 
statement  will  be  explained  in  Ch.  III. 

When  b 2  =  ac  the  equation  has  only  one  root.  For  the  sake  of  uniformity  it  is  generally 
said  in  this  case  to  have  ‘two  equal’  roots,  but  this  is  a  mere  convention. 
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Suppose  a,  6,  c  rational.  Nothing  is  lost  by  taking  all  three  to  be  integers,  for  we 
can  multiply  the  equation  by  the  least  common  multiple  of  their  denominators. 

The  reader  will  remember  that  the  roots  are  {— b  ±  s/b2  —  ac}/a.  It  is  easy 
to  construct  these  lengths  geometrically,  first  constructing  \Jb2  —  ac.  A  much 
more  elegant,  though  less  straightforward,  construction  is  the  following. 

Draw  a  circle  of  unit  radius,  a  diameter  PQ,  and  the  tangents  at  the  ends 
of  the  diameters. 


P 


Fig.  5. 

Take  PP'  =  —2 a/b  and  QQ'  =  —c/26,  having  regard  to  sign.*  Join  P'Q' , 
cutting  the  circle  in  M  and  N .  Draw  PM  and  PN,  cutting  QQ'  in  X  and  Y . 
Then  QX  and  QY  are  the  roots  of  the  equation  with  their  proper  signs f 

The  proof  is  simple  and  we  leave  it  as  an  exercise  to  the  reader.  Another, 
perhaps  even  simpler,  construction  is  the  following.  Take  a  line  AB  of  unit 
length.  Draw  BC  =  —2b/ a  perpendicular  to  AB,  and  CD  =  c/a  perpendicular 
to  BC  and  in  the  same  direction  as  BA.  On  AD  as  diameter  describe  a  circle 
cutting  BC  in  X  and  Y .  Then  BX  and  BY  are  the  roots. 

3.  If  ac  is  positive  PP'  and  QQ'  will  be  drawn  in  the  same  direction.  Verify 
that  P'Q'  will  not  meet  the  circle  if  6 2  <  ac,  while  if  62  =  ac  it  will  be  a  tangent. 
Verify  also  that  if  b 2  =  ac  the  circle  in  the  second  construction  will  touch  BC. 

4.  Prove  that 

Vpq  =  Vpx  =  pVq- 


*The  figure  is  drawn  to  suit  the  case  in  which  b  and  c  have  the  same  and  a  the 
opposite  sign.  The  reader  should  draw  figures  for  other  cases. 

'  I  have  taken  this  construction  from  Klein’s  Legons  sur  certaines  questions  de 
geometrie  elementaire  (French  translation  by  J.  Griess,  Paris,  1896). 
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14.  Some  theorems  concerning  quadratic  surds.  Two  pure 
quadratic  surds  are  said  to  be  similar  if  they  can  be  expressed  as  rational 
multiples  of  the  same  surd,  and  otherwise  to  be  dissimilar.  Thus 


=  2y/2,  sjf  =  |V2, 


and  so  \/8,  are  similar  surds.  On  the  other  hand,  if  M  and  N  are 
integers  which  have  no  common  factor,  and  neither  of  which  is  a  perfect 
square,  \[M  and  \[N  are  dissimilar  surds.  For  suppose,  if  possible, 


where  all  the  letters  denote  integers. 

Then  a JMN  is  evidently  rational,  and  therefore  (Ex.  II.  3)  integral. 
Thus  MN  =  P2,  where  P  is  an  integer.  Let  a,  b ,  c, . . .  be  the  prime 
factors  of  P,  so  that 

MN  =  a2ab2f)c21  ..., 

where  a,  (3,  7, . . .  are  positive  integers.  Then  MN  is  divisible  by  a2a,  and 
therefore  either  (1)  M  is  divisible  by  a2a,  or  (2)  N  is  divisible  by  a2“,  or 
(3)  M  and  N  are  both  divisible  by  a.  The  last  case  may  be  ruled  out, 
since  M  and  N  have  no  common  factor.  This  argument  may  be  applied  to 
each  of  the  factors  a2a,  b2/3 ,  c27, . . . ,  so  that  M  must  be  divisible  by  some 
of  these  factors  and  N  by  the  remainder.  Thus 


M  =  P2,  N  =  P2, 

where  P2  denotes  the  product  of  some  of  the  factors  a2",  b213,  c2"7, . . .  and 
Pf  the  product  of  the  rest.  Hence  M  and  N  are  both  perfect  squares, 
which  is  contrary  to  our  hypothesis. 

Theorem.  If  A,  B,  C,  D  are  rational  and 

a  +  Vb  =  c  +  Vd, 


then  either  (i)  A  =  C ,  B  =  D  or  (ii)  B  and  D  are  both  squares  of  rational 
numbers. 
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For  B  —  D  is  rational,  and  so  is 

VB-VD  =  C  -A. 

If  B  is  not  equal  to  D  (in  which  case  it  is  obvious  that  A  is  also  equal 
to  C ),  it  follows  that 

VB  +  VD  =  (B  -  D)/(y/B  -  VD) 

is  also  rational.  Hence  y/~B  and  \fTf)  are  rational. 

Corollary.  If  A  +  yJ~B  =  C  +  y/~D ,  then  A  —  yJ~B  =  C  —  yfD  ( unless 
y/~B  and  yfD  are  both  rational). 

Examples  VIII.  1.  Prove  ab  initio  that  y/2  and  y/3  are  not  similar 
surds. 

2.  Prove  that  yfa  and  /a,  where  a  is  rational,  are  similar  surds  (unless 
both  are  rational). 

3.  If  a  and  b  are  rational,  then  \fa+\/b  cannot  be  rational  unless  yfa  and  \fb 
are  rational.  The  same  is  true  of  \fa  —  \/b,  unless  a  =  b. 

4.  K  _ 

\/A  +  \lB=  y/C  +  VD, 

then  either  (a)  A  =  C  and  B  =  D,  or  (6)  A  =  D  and  B  =  C,  or  (c)  y/A,  y/B, 
yJC,  VD  are  all  rational  or  all  similar  surds.  [Square  the  given  equation  and 
apply  the  theorem  above.] 

5.  Neither  (a  +  \/b)3  nor  (a  —  V^)3  can  be  rational  unless  yfb  is  rational. 

6.  Prove  that  if  x  =  p  +  y/q,  where  p  and  q  are  rational,  then  xm,  where 
m  is  any  integer,  can  be  expressed  in  the  form  P  +  Qy/q,  where  P  and  Q  are 
rational.  For  example, 

(P  +  y/q?  =  P2  +  Q  +  2  Pyfq,  (p  +  y/qf  =  p3  +  3  pq  +  (3  p2  +  q)  yfq. 

Deduce  that  any  polynomial  in  x  with  rational  coefficients  (i.e.  any  expression 
of  the  form 

a0xn  +  aixn_1  H - h  an, 

where  ao,  . . . ,  an  are  rational  numbers)  can  be  expressed  in  the  form  P  +  Qy/q. 
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7.  If  a  +  Vb,  where  b  is  not  a  perfect  square,  is  the  root  of  an  algebraical 
equation  with  rational  coefficients,  then  a  —  Vb  is  another  root  of  the  same 
equation. 

8.  Express  l/(p  +  y/q)  in  the  form  prescribed  in  Ex.  6.  [Multiply  numerator 
and  denominator  by  p  —  y/q.] 

9.  Deduce  from  Exs.  6  and  8  that  any  expression  of  the  form  G(x)/H(x), 
where  G{x)  and  H(x)  are  polynomials  in  x  with  rational  coefficients,  can  be 
expressed  in  the  form  P  +  Qy/q,  where  P  and  Q  are  rational. 

10.  If  p,  q,  and  p2  —  q  are  positive,  we  can  express  yjp  +  y/q  in  the  form 
yfx  +  /y,  where 

x  =  \{p+  \Jp 2  -  q},  y  =  \{p  -  Vv2  -  <?}■ 

11.  Determine  the  conditions  that  it  may  be  possible  to  express  yjp  +  y/q, 
where  p  and  q  are  rational,  in  the  form  /x  +  y/y,  where  x  and  y  are  rational. 

12.  If  a2  —  b  is  positive,  the  necessary  and  sufficient  conditions  that 

\j  a  +  Vb  +  \j  a  -  Vb 

should  be  rational  are  that  a2  —  b  and  ^{a  +  V a2  —  b}  should  both  be  squares 
of  rational  numbers. 

15.  The  continuum.  The  aggregate  of  all  real  numbers,  rational 
and  irrational,  is  called  the  arithmetical  continuum. 

It  is  convenient  to  suppose  that  the  straight  line  A  of  §  2  is  composed  of 
points  corresponding  to  all  the  numbers  of  the  arithmetical  continuum,  and 
of  no  others.*  The  points  of  the  line,  the  aggregate  of  which  may  be  said 
to  constitute  the  linear  continuum,  then  supply  us  with  a  convenient 
image  of  the  arithmetical  continuum. 

We  have  considered  in  some  detail  the  chief  properties  of  a  few  classes 
of  real  numbers,  such,  for  example,  as  rational  numbers  or  quadratic  surds. 

*This  supposition  is  merely  a  hypothesis  adopted  (i)  because  it  suffices  for  the 
purposes  of  our  geometry  and  (ii)  because  it  provides  us  with  convenient  geometrical 
illustrations  of  analytical  processes.  As  we  use  geometrical  language  only  for  purposes 
of  illustration,  it  is  not  part  of  our  business  to  study  the  foundations  of  geometry. 


[1:15] 


REAL  VARIABLES 


27 


We  add  a  few  further  examples  to  show  how  very  special  these  particular 
classes  of  numbers  are,  and  how,  to  put  it  roughly,  they  comprise  only 
a  minute  fraction  of  the  infinite  variety  of  numbers  which  constitute  the 
continuum. 

(i)  Let  us  consider  a  more  complicated  surd  expression  such  as 

z  =  \j 4  +  V15  +  \/ A-  \/T5. 

Our  argument  for  supposing  that  the  expression  for  z  has  a  meaning  might  be 
as  follows.  We  first  show,  as  in  §  12,  that  there  is  a  number  y  =  \/l5  such  that 
y 2  =  15,  and  we  can  then,  as  in  §  10,  define  the  numbers  4  +  V15,  4  —  \/l5.  Now 
consider  the  equation  in  z\, 

z\  =  A  +  v/15. 

The  right-hand  side  of  this  equation  is  not  rational:  but  exactly  the  same  rea¬ 
soning  which  leads  us  to  suppose  that  there  is  a  real  number  x  such  that  x3  =  2 
(or  any  other  rational  number)  also  leads  us  to  the  conclusion  that  there  is  a 
number  z\  such  that  z3  =  4+ \/l5.  We  thus  define  z\  =  \J  A  +  \/l5,  and  similarly 
we  can  define  22  =  \/ A  —  \/t5;  and  then,  as  in  §  10,  we  define  z  =  z\  +  22. 

Now  it  is  easy  to  verify  that 


23  =  32  +  8. 

And  we  might  have  given  a  direct  proof  of  the  existence  of  a  unique  number  z 
such  that  z3  =  32  +  8.  It  is  easy  to  see  that  there  cannot  be  two  such  numbers. 
For  if  z3  =  32i  +  8  and  z\  =  322  +  8,  we  find  on  subtracting  and  dividing  by 
z\  —  Z2  that  z\  +  z\Z2  +  z\  =  3.  But  if  z\  and  22  are  positive  z3  >  8,  z\  >  8  and 
therefore  z\  >  2,  22  >  2,  z\  +  Z\Z2  +  z\  >  12,  and  so  the  equation  just  found  is 
impossible.  And  it  is  easy  to  see  that  neither  21  nor  22  can  be  negative.  For  if 
z\  is  negative  and  equal  to  —  £,  (  is  positive  and  C3  —  3£  +  8  =  0,  or  3  —  £2  =  8/C- 
Hence  3  —  £2  >  0,  and  so  £  <  2.  But  then  8/C  >  4,  and  so  8/C  cannot  be  equal 
to  3  —  C2  5  which  is  less  than  3. 

Hence  there  is  at  most  one  2  such  that  z3  =  3z  +  8.  And  it  cannot  be 
rational.  For  any  rational  root  of  this  equation  must  be  integral  and  a  factor 
of  8  (Ex.  II.  3),  and  it  is  easy  to  verify  that  no  one  of  1,  2,  4,  8  is  a  root. 

Thus  23  =  32  +  8  has  at  most  one  root  and  that  root,  if  it  exists,  is  positive 
and  not  rational.  We  can  now  divide  the  positive  rational  numbers  x  into  two 
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classes  L,  R  according  as  x3  <  3x  +  8  or  x3  >  3x  +  8.  It  is  easy  to  see  that  if 
x 3  >  3x  +  8  and  y  is  any  number  greater  than  x,  then  also  y 3  >  2>y  +  8.  For 
suppose  if  possible  y3  ^  3y  +  8.  Then  since  x3  >  3x  +  8  we  obtain  on  subtracting 
y3  —  x3  <  3 (y  —  x),  or  y2  +  xy  +  x2  <  3,  which  is  impossible;  for  y  is  positive 
and  x  >  2  (since  x3  >  8) .  Similarly  we  can  show  that  if  x3  <  3x  +  8  and  y  <  x 
then  also  y3  <  3y  +  8. 

Finally,  it  is  evident  that  the  classes  L  and  R  both  exist;  and  they  form  a 
section  of  the  positive  rational  numbers  or  positive  real  number  z  which  satisfies 
the  equation  z3  =  3z  +  8.  The  reader  who  knows  how  to  solve  cubic  equations 
by  Cardan’s  method  will  be  able  to  obtain  the  explicit  expression  of  z  directly 
from  the  equation. 

(ii)  The  direct  argument  applied  above  to  the  equation  x3  =  3x  +  8 
could  be  applied  (though  the  application  would  be  a  little  more  difficult) 
to  the  equation 

x5  =  x  +  16, 

and  would  lead  us  to  the  conclusion  that  a  unique  positive  real  number 
exists  which  satisfies  this  equation.  In  this  case,  however,  it  is  not  possible 
to  obtain  a  simple  explicit  expression  for  x  composed  of  any  combination 
of  surds.  It  can  in  fact  be  proved  (though  the  proof  is  difficult)  that  it  is 
generally  impossible  to  find  such  an  expression  for  the  root  of  an  equation 
of  higher  degree  than  4.  Thus,  besides  irrational  numbers  which  can  be 
expressed  as  pure  or  mixed  quadratic  or  other  surds,  or  combinations  of 
such  surds,  there  are  others  which  are  roots  of  algebraical  equations  but 
cannot  be  so  expressed.  It  is  only  in  very  special  cases  that  such  expressions 
can  be  found. 

(iii)  But  even  when  we  have  added  to  our  list  of  irrational  numbers 
roots  of  equations  (such  as  x5  =  x+16)  which  cannot  be  explicitly  expressed 
as  surds,  we  have  not  exhausted  the  different  kinds  of  irrational  numbers 
contained  in  the  continuum.  Let  us  draw  a  circle  whose  diameter  is  equal 
to  A0A1,  i.e.  to  unity.  It  is  natural  to  suppose*  that  the  circumference  of 
such  a  circle  has  a  length  capable  of  numerical  measurement.  This  length 


A  proof  will  be  found  in  Ch.  VII. 
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is  usually  denoted  by  tt.  And  it  has  been  shown*  (though  the  proof  is 
unfortunately  long  and  difficult)  that  this  number  n  is  not  the  root  of  any 
algebraical  equation  with  integral  coefficients,  such,  for  example,  as 

7r2  =  n,  7 r3  —  n,  n5  =  tx  +  n, 

where  n  is  an  integer.  In  this  way  it  is  possible  to  define  a  number  which 
is  not  rational  nor  yet  belongs  to  any  of  the  classes  of  irrational  numbers 
which  we  have  so  far  considered.  And  this  number  n  is  no  isolated  or  ex¬ 
ceptional  case.  Any  number  of  other  examples  can  be  constructed.  In  fact 
it  is  only  special  classes  of  irrational  numbers  which  are  roots  of  equations 
of  this  kind,  just  as  it  is  only  a  still  smaller  class  which  can  be  expressed 
by  means  of  surds. 

16.  The  continuous  real  variable.  The  ‘real  numbers’  may  be  re¬ 
garded  from  two  points  of  view.  We  may  think  of  them  as  an  aggregate , 
the  ‘arithmetical  continuum’  defined  in  the  preceding  section,  or  individ¬ 
ually.  And  when  we  think  of  them  individually,  we  may  think  either  of  a 
particular  specified  number  (such  as  1,  —  y/2,  or  7r)  or  we  may  think  of 

any  number,  an  unspecified  number,  the  number  x.  This  last  is  our  point 
of  view  when  we  make  such  assertions  as  lx  is  a  number’,  ‘x  is  the  mea¬ 
sure  of  a  length’,  ‘rr  may  be  rational  or  irrational’.  The  x  which  occurs  in 
propositions  such  as  these  is  called  the  continuous  real  variable:  and  the 
individual  numbers  are  called  the  values  of  the  variable. 

A  ‘variable’,  however,  need  not  necessarily  be  continuous.  Instead  of 
considering  the  aggregate  of  all  real  numbers,  we  might  consider  some 
partial  aggregate  contained  in  the  former  aggregate,  such  as  the  aggregate 
of  rational  numbers,  or  the  aggregate  of  positive  integers.  Let  us  take  the 
last  case.  Then  in  statements  about  any  positive  integer,  or  an  unspecified 
positive  integer,  such  as  ln  is  either  odd  or  even’,  n  is  called  the  variable,  a 
positive  integral  variable ,  and  the  individual  positive  integers  are  its  values. 

Naturally  ‘x’  and  ‘n’  are  only  examples  of  variables,  the  variable  whose 
‘field  of  variation’  is  formed  by  all  the  real  numbers,  and  that  whose  field  is 

*See  Hobson’s  Trigonometry  (3rd  edition),  pp.  305  et  seq.,  or  the  same  writer’s 
Squaring  the  Circle  (Cambridge,  1913). 
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formed  by  the  positive  integers.  These  are  the  most  important  examples, 
but  we  have  often  to  consider  other  cases.  In  the  theory  of  decimals,  for 
instance,  we  may  denote  by  x  any  figure  in  the  expression  of  any  number 
as  a  decimal.  Then  x  is  a  variable,  but  a  variable  which  has  only  ten 
different  values,  viz.  0,  1,  2,  3,  4,  5,  6,  7,  8,  9.  The  reader  should  think  of 
other  examples  of  variables  with  different  fields  of  variation.  He  will  find 
interesting  examples  in  ordinary  life:  policeman  x,  the  driver  of  cab  x,  the 
year  x,  the  ofh  day  of  the  week.  The  values  of  these  variables  are  naturally 
not  numbers. 

17.  Sections  of  the  real  numbers.  In  §§  4-7  we  considered  ‘sec¬ 
tions’  of  the  rational  numbers,  i.e.  modes  of  division  of  the  rational  num¬ 
bers  (or  of  the  positive  rational  numbers  only)  into  two  classes  L  and  R 
possessing  the  following  characteristic  properties: 

(i)  that  every  number  of  the  type  considered  belongs  to  one  and  only 
one  of  the  two  classes; 

(ii)  that  both  classes  exist; 

(iii)  that  any  member  of  L  is  less  than  any  member  of  R. 

It  is  plainly  possible  to  apply  the  same  idea  to  the  aggregate  of  all  real 
numbers,  and  the  process  is,  as  the  reader  will  find  in  later  chapters,  of 
very  great  importance. 

Let  us  then  suppose*  that  P  and  Q  are  two  properties  which  are  mutu¬ 
ally  exclusive,  and  one  of  which  is  possessed  by  every  real  number.  Further 
let  us  suppose  that  any  number  which  possesses  P  is  less  than  any  which 
possesses  Q.  We  call  the  numbers  which  possess  P  the  lower  or  left-hand 
class  L,  and  those  which  possess  Q  the  upper  or  right-hand  class  R. 

Thus  P  might  be  x  ^  \/2  and  Q  be  x  >  \/2.  It  is  important  to  observe 
that  a  pair  of  properties  which  suffice  to  define  a  section  of  the  rational  numbers 

*The  discussion  which  follows  is  in  many  ways  similar  to  that  of  §  6.  We  have  not 
attempted  to  avoid  a  certain  amount  of  repetition.  The  idea  of  a  ‘section,’  first  brought 
into  prominence  in  Dedekind’s  famous  pamphlet  Stetigkeit  und  irrationale  Zahlen ,  is 
one  which  can,  and  indeed  must,  be  grasped  by  every  reader  of  this  book,  even  if  he 
be  one  of  those  who  prefer  to  omit  the  discussion  of  the  notion  of  an  irrational  number 
contained  in  §§  6-12. 
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may  not  suffice  to  define  one  of  the  real  numbers.  This  is  so,  for  example,  with 
the  pair  'x  <  \/T  and  ‘x  >  \/2’  or  (if  we  confine  ourselves  to  positive  numbers) 
with  ‘x2  <  2’  and  ‘x2  >  2’.  Every  rational  number  possesses  one  or  other 
of  the  properties,  but  not  every  real  number,  since  in  either  case  escapes 
classification. 

There  are  now  two  possibilities.*  Either  L  has  a  greatest  member  Z,  or 
R  has  a  least  member  r.  Both  of  these  events  cannot  occur.  For  if  L  had 
a  greatest  member  l,  and  R  a  least  member  r,  the  number  |(Z  +  r)  would 
be  greater  than  all  members  of  L  and  less  than  all  members  of  R.  and  so 
could  not  belong  to  either  class.  On  the  other  hand  one  event  must  occur 7 

For  let  L\  and  R\  denote  the  classes  formed  from  L  and  R  by  taking 
only  the  rational  members  of  L  and  R.  Then  the  classes  Li  and  R\  form 
a  section  of  the  rational  numbers.  There  are  now  two  cases  to  distinguish. 

It  may  happen  that  Li  has  a  greatest  member  a.  In  this  case  a  must 
be  also  the  greatest  member  of  L.  For  if  not,  we  could  find  a  greater,  say  f3. 
There  are  rational  numbers  lying  between  a  and  /3,  and  these,  being  less 
than  (3,  belong  to  L,  and  therefore  to  Lr  and  this  is  plainly  a  contradiction. 
Hence  a  is  the  greatest  member  of  L. 

On  the  other  hand  it  may  happen  that  L\  has  no  greatest  member.  In 
this  case  the  section  of  the  rational  numbers  formed  by  Li  and  R\  is  a  real 
number  a.  This  number  a  must  belong  to  L  or  to  R.  If  it  belongs  to  L 
we  can  show,  precisely  as  before,  that  it  is  the  greatest  member  of  L ,  and 
similarly,  if  it  belongs  to  R ,  it  is  the  least  member  of  R. 

Thus  in  any  case  either  L  has  a  greatest  member  or  R  a  least.  Any 
section  of  the  real  numbers  therefore  ‘corresponds’  to  a  real  number  in 
the  sense  in  which  a  section  of  the  rational  numbers  sometimes,  but  not 
always,  corresponds  to  a  rational  number.  This  conclusion  is  of  very  great 
importance;  for  it  shows  that  the  consideration  of  sections  of  all  the  real 
numbers  does  not  lead  to  any  further  generalisation  of  our  idea  of  number. 
Starting  from  the  rational  numbers,  we  found  that  the  idea  of  a  section  of 
the  rational  numbers  led  us  to  a  new  conception  of  a  number,  that  of  a  real 
number,  more  general  than  that  of  a  rational  number;  and  it  might  have 


*There  were  three  in  §  6. 
^This  was  not  the  case  in  §  6. 
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been  expected  that  the  idea  of  a  section  of  the  real  numbers  would  have  led 
us  to  a  conception  more  general  still.  The  discussion  which  precedes  shows 
that  this  is  not  the  case,  and  that  the  aggregate  of  real  numbers,  or  the 
continuum,  has  a  kind  of  completeness  which  the  aggregate  of  the  rational 
numbers  lacked,  a  completeness  which  is  expressed  in  technical  language 
by  saying  that  the  continuum  is  closed. 

The  result  which  we  have  just  proved  may  be  stated  as  follows: 

Dedekind’s  Theorem.  If  the  real  numbers  are  divided  into  two 
classes  L  and  R  in  such  a  way  that 

(i)  every  number  belongs  to  one  or  other  of  the  two  classes, 

(ii)  each  class  contains  at  least  one  number, 

(iii)  any  member  of  L  is  less  than  any  member  of  R, 

then  there  is  a  number  a,  which  has  the  property  that  all  the  numbers  less 
than  it  belong  to  L  and  all  the  numbers  greater  than  it  to  R.  The  number  a 
itself  may  belong  to  either  class. 

In  applications  we  have  often  to  consider  sections  not  of  all  numbers  but 
of  all  those  contained  in  an  interval  [(5, 7],  that  is  to  say  of  all  numbers  x  such 
that  /3  ^  x  ^  7.  A  ‘section’  of  such  numbers  is  of  course  a  division  of  them  into 
two  classes  possessing  the  properties  (i),  (ii),  and  (iii).  Such  a  section  may  be 
converted  into  a  section  of  all  numbers  by  adding  to  L  all  numbers  less  than  /? 
and  to  R  all  numbers  greater  than  7.  It  is  clear  that  the  conclusion  stated  in 
Dedekind’s  Theorem  still  holds  if  we  substitute  ‘the  real  numbers  of  the  interval 
[/3, 7]’  for  ‘the  real  numbers’,  and  that  the  number  a  in  this  case  satisfies  the 
inequalities  f3  ^  a  ^  7. 

18.  Points  of  accumulation.  A  system  of  real  numbers,  or  of  the 
points  on  a  straight  line  corresponding  to  them,  defined  in  any  way  what¬ 
ever,  is  called  an  aggregate  or  set  of  numbers  or  points.  The  set  might 
consist,  for  example,  of  all  the  positive  integers,  or  of  all  the  rational  points. 

It  is  most  convenient  here  to  use  the  language  of  geometry.*  Suppose 

*The  reader  will  hardly  require  to  be  reminded  that  this  course  is  adopted  solely  for 
reasons  of  linguistic  convenience. 
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then  that  we  are  given  a  set  of  points,  which  we  will  denote  by  S.  Take 
any  point  £,  which  may  or  may  not  belong  to  S.  Then  there  are  two 
possibilities.  Either  (i)  it  is  possible  to  choose  a  positive  number  <5  so  that 
the  interval  [£  — 5,  £+5]  does  not  contain  any  point  of  S,  other  than  £  itself,* 
or  (ii)  this  is  not  possible. 

Suppose,  for  example,  that  S  consists  of  the  points  corresponding  to  all  the 
positive  integers.  If  £  is  itself  a  positive  integer,  we  can  take  5  to  be  any  number 
less  than  1,  and  (i)  will  be  true;  or,  if  £  is  halfway  between  two  positive  integers, 
we  can  take  5  to  be  any  number  less  than  1 .  On  the  other  hand,  if  S  consists  of 
all  the  rational  points,  then,  whatever  the  value  of  £,  (ii)  is  true;  for  any  interval 
whatever  contains  an  infinity  of  rational  points. 

Let  us  suppose  that  (ii)  is  true.  Then  any  interval  [£  —  S,  £  +  5],  however 
small  its  length,  contains  at  least  one  point  £i  which  belongs  to  S  and  does 
not  coincide  with  £;  and  this  whether  £  itself  be  a  member  of  S  or  not. 
In  this  case  we  shall  say  that  £  is  a  point  of  accumulation  of  S.  It  is 
easy  to  see  that  the  interval  [£  —  <5,  £  +  5]  must  contain,  not  merely  one,  but 
infinitely  many  points  of  S.  For,  when  we  have  determined  £i,  we  can  take 
an  interval  [£  —  <Si,  £  +  <5i]  surrounding  £  but  not  reaching  as  far  as  £i.  But 
this  interval  also  must  contain  a  point,  say  £2,  which  is  a  member  of  S  and 
does  not  coincide  with  £.  Obviously  we  may  repeat  this  argument,  with 
£2  in  the  place  of  £1;  and  so  on  indefinitely.  In  this  way  we  can  determine 
as  many  points 

£i>  £2^  £3^  •  •  • 

as  we  please,  all  belonging  to  S,  and  all  lying  inside  the  interval  [£  —  5,  £+5]. 

A  point  of  accumulation  of  S  may  or  may  not  be  itself  a  point  of  S. 
The  examples  which  follow  illustrate  the  various  possibilities. 


Examples  IX.  1.  If  S  consists  of  the  points  corresponding  to  the  posi¬ 
tive  integers,  or  all  the  integers,  there  are  no  points  of  accumulation. 

2.  If  S  consists  of  all  the  rational  points,  every  point  of  the  line  is  a  point 
of  accumulation. 

*This  clause  is  of  course  unnecessary  if  £  does  not  itself  belong  to  S. 
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3.  If  S  consists  of  the  points  1,  |, . . . ,  there  is  one  point  of  accumulation, 

viz.  the  origin. 

4.  If  S  consists  of  all  the  positive  rational  points,  the  points  of  accumulation 
are  the  origin  and  all  positive  points  of  the  line. 


19.  Weierstrass’s  Theorem.  The  general  theory  of  sets  of  points  is 
of  the  utmost  interest  and  importance  in  the  higher  branches  of  analysis; 
but  it  is  for  the  most  part  too  difficult  to  be  included  in  a  book  such  as 
this.  There  is  however  one  fundamental  theorem  which  is  easily  deduced 
from  Dedekind’s  Theorem  and  which  we  shall  require  later. 

Theorem.  If  a  set  S  contains  infinitely  many  points,  and  is  entirely 
situated  in  an  interval  [ot,fd\,  then  at  least  one  point  of  the  interval  is  a 
point  of  accumulation  of  S. 

We  divide  the  points  of  the  line  A  into  two  classes  in  the  following  man¬ 
ner.  The  point  P  belongs  to  L  if  there  are  an  infinity  of  points  of  S  to  the 
right  of  P,  and  to  R  in  the  contrary  case.  Then  it  is  evident  that  conditions 
(i)  and  (iii)  of  Dedekind’s  Theorem  are  satisfied;  and  since  a  belongs  to  L 
and  ft  to  R ,  condition  (ii)  is  satisfied  also. 

Hence  there  is  a  point  f  such  that,  however  small  be  5,  f  —  6  belongs 
to  L  and  f  +  6  to  R,  so  that  the  interval  [£  —  8,  £  +  5]  contains  an  infinity 
of  points  of  S.  Hence  f  is  a  point  of  accumulation  of  S. 

This  point  may  of  course  coincide  with  a  or  /3,  as  for  instance  when  a  =  0, 
(3  =  1,  and  S  consists  of  the  points  1,  |, . . . .  In  this  case  0  is  the  sole  point 

of  accumulation. 


MISCELLANEOUS  EXAMPLES  ON  CHAPTER  I. 


1.  What  are  the  conditions  that  ax  +  by  +  cz  =  0,  (1)  for  all  values  of  x, 
y,  z\  (2)  for  all  values  of  x,  y,  z  subject  to  ax  +  (3y  +  'yz  =  0;  (3)  for  all  values 
of  x,  y,  z  subject  to  both  ax  +  (3y  +  yz  =  0  and  Ax  +  By  +  Cz  =  0? 

2.  Any  positive  rational  number  can  be  expressed  in  one  and  only  one  way 
in  the  form 


ai  + 


_U2_ 
1  •  2 


+ 


«3 

1-2-3 


+  •••  + 


ak 


1-2-3  ...C 
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where  ai,  a,2,  ■  ■  ■ ,  ctfc  are  integers,  and 


0  ^  a\,  0  ^  CL2  <  2,  0  as  <  3, 


0  <  cik  <  k. 


3.  Any  positive  rational  number  can  be  expressed  in  one  and  one  way  only 
as  a  simple  continued  fraction 


where  ai,  02, . . .  are  positive  integers,  of  which  the  first  only  may  be  zero. 

[Accounts  of  the  theory  of  such  continued  fractions  will  be  found  in  text¬ 
books  of  algebra.  For  further  information  as  to  modes  of  representation  of 
rational  and  irrational  numbers,  see  Hobson,  Theory  of  Functions  of  a  Real 
Variable ,  pp.  45-49.] 

4.  Find  the  rational  roots  (if  any)  of  9.x3  —  6x2  +  15x  —  10  =  0. 

5.  A  line  AB  is  divided  at  C  in  aurea  sectione  (Euc.  II.  11) — i.e.  so  that 
AB  •  AC  =  BC2.  Show  that  the  ratio  AC  /AB  is  irrational. 

[A  direct  geometrical  proof  will  be  found  in  Bromwich’s  Infinite  Series,  §  143, 
p.  363.1 

r  j  (2^4.  — (—  l) 

6.  A  is  irrational.  In  what  circumstances  can  — - - ,  where  a,  b,  c,  d  are 

r  A  A-  d 

rational,  be  rational? 

7.  Some  elementary  inequalities.  In  what  follows  ai,  02,...  denote 
positive  numbers  (including  zero)  and  p,  q, . . .  positive  integers.  Since  a\  —  av2 
and  a\  —  a\  have  the  same  sign,  we  have  (a\  —  a%)(af  —  a2)  ^  0,  or 

a \+q  +  a2+q  ^  a\a\  +  a\a\,  (1) 

an  inequality  which  may  also  be  written  in  the  form 


V  1  P 

+  elf. 


g  1  g 
a\  +  a2 


By  repeated  application  of  this  formula  we  obtain 


p+g+r+... 


v+q+r+... 


P  1  P 
al  +  a2 


g  1  g 
a\  +  a2 


af  +  a2 
2 
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and  in  particular 

ai  +  a2  >  ^  ai  +  0,2  \  p 

When p  =  q  =  1  in  (1),  or  p  =  2  in  (4),  the  inequalities  are  merely  different  forms 
of  the  inequality  af  +  a|  ^  2aia2,  which  expresses  the  fact  that  the  arithmetic 
mean  of  two  positive  numbers  is  not  less  than  their  geometric  mean. 

8.  Generalisations  for  n  numbers.  If  we  write  down  the  \n(n  —  1) 
inequalities  of  the  type  (1)  which  can  be  formed  with  n  numbers  ai,  02,  •  •  • ,  an, 
and  add  the  results,  we  obtain  the  inequality 

nZap+q  ^  (5) 

or 

(J2a-+1)  In  a  {(£tf>)  /»}  {(£««)  /»}  •  (6) 

Hence  we  can  deduce  an  obvious  extension  of  (3)  which  the  reader  may  formulate 
for  himself,  and  in  particular  the  inequality 

(E«P)/n^  {(£«)/<•  (7) 

9.  The  general  form  of  the  theorem  concerning  the  arithmetic 
and  geometric  means.  An  inequality  of  a  slightly  different  character  is  that 
which  asserts  that  the  arithmetic  mean  of  ai,  a 2,  . . . ,  an  is  not  less  than  their 
geometric  mean.  Suppose  that  ar  and  as  are  the  greatest  and  least  of  the  a’s  (if 
there  are  several  greatest  or  least  a’s  we  may  choose  any  of  them  indifferently), 
and  let  G  be  their  geometric  mean.  We  may  suppose  G  >  0,  as  the  truth  of  the 
proposition  is  obvious  when  G  =  0.  If  now  we  replace  ar  and  as  by 

ar  —  G,  as  —  CLrds  /  G , 

we  do  not  alter  the  value  of  the  geometric  mean;  and,  since 

ar  +  as  —  ar  —  as  =  (ar  —  G)(as  —  G)/G  ^  0, 

we  certainly  do  not  increase  the  arithmetic  mean. 

It  is  clear  that  we  may  repeat  this  argument  until  we  have  replaced  each  of 
ai,  a2,  . . . ,  an  by  G;  at  most  n  repetitions  will  be  necessary.  As  the  final  value 
of  the  arithmetic  mean  is  G,  the  initial  value  cannot  have  been  less. 


(4) 
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10.  Schwarz’s  inequality.  Suppose  that  ai,  02,  . . . ,  an  and  b\,  b2,  ■  ■  ■ ,  bn 
are  any  two  sets  of  numbers  positive  or  negative.  It  is  easy  to  verify  the  identity 

(XXM2  =  EarEas  -  E(aA  ~  asbr )2, 

where  r  and  s  assume  the  values  1,2 ,  . . . ,  n.  It  follows  that 

(E arbrf  ^  EarE&r> 

an  inequality  usually  known  as  Schwarz’s  (though  due  originally  to  Cauchy). 

11.  If  ai,  a2,  . . . ,  an  are  all  positive,  and  sn  =  ai  +  02  +  •  •  •  +  an,  then 

„2  „n 

(1  +  oi)(l  +  af) . . .  (1  +  an )  fs  1  +  Sn.  +  +  •  -  *  +  • 

(Math.  Trip.  1909.) 

12.  If  ai,  d2,  ■  •  • ,  an  and  b\,  62,  •  •  • ,  bn  are  two  sets  of  positive  numbers, 
arranged  in  descending  order  of  magnitude,  then 

(<2l  +  02  +  '  '  '  H“  &n)(bl  4“  &2  +  ‘  '  '  ~t~  bn)  ^  Tl(a\b\  +  Q-2&2  +  '  '  '  +  dnbn )■ 

13.  If  a,  b,  c,  . . .  k  and  A,  B,  C,  . . .  K  are  two  sets  of  numbers,  and  all  of 
the  first  set  are  positive,  then 

a  A  +  bB  +  •  •  •  +  kK 
a  ~\~  b  k 

lies  between  the  algebraically  least  and  greatest  of  A,  B,  ... ,  K. 

14.  If  v/p,  yfq  are  dissimilar  surds,  and  a  +  byjp  +  c^/q  +  dy/pq  =  0,  where 
a,  b,  c,  d  are  rational,  then  a  =  0,  b  =  0,  c  =  0,  d  =  0. 

[Express  yfp  in  the  form  AI  +  N y/q,  where  AI  and  N  are  rational,  and  apply 
the  theorem  of  §  14.] 

15.  Show  that  if  ay/2  +  b\f/>  +  c\J 5  =  0,  where  a,  b ,  c  are  rational  numbers, 
then  a  =  0,  b  =  0,  c  =  0. 

16.  Any  polynomial  in  yfp  and  y/q,  with  rational  coefficients  (i.e.  any  sum  of 
a  finite  number  of  terms  of  the  form  A(yfp)m(y/q)n ,  where  m  and  n  are  integers, 
and  A  rational),  can  be  expressed  in  the  form 


a  +  by/p  +  Cy/q  +  dyfpq, 
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where  a,  b,  c,  d  are  rational. 


17.  Express 


a  +  by/p  +  c^fq 

d  +  ey/p  +  fy/q' 


where  a,  b,  etc.  are  rational,  in  the  form 


A  +  B  sfp  +  Cy/q  +  D  y/pq, 


where  A,  B ,  C,  D  are  rational. 

[Evidently 

a  +  b^p  +  Cy/q  _  (a  +  by/p  +  Cy/q)(d  +  e^/p  -  fy/q) 
d  +  e\/P  +  / \fd  (d  +  e^/p)2  -  f2q 

_  «  +  /3y/p  +  7  Vo_  +  d  sjyq 

e  +  C  y/P 

where  a,  (3,  etc.  are  rational  numbers  which  can  easily  be  found.  The  required 
reduction  may  now  be  easily  completed  by  multiplication  of  numerator  and 
denominator  by  e  —  C\/P-  For  example,  prove  that 

1  1  1  nr  1 

- 7= - 7=  —  x  +  7  V^2  —  7 

1  +  V2  +  V3  2  4  4 


18.  If  a,  6,  x,  y  are  rational  numbers  such  that 

(ay  —  bx )2  +  4  (a  —  x)(b  —  y)  =  0, 

then  either  (i)  x  =  a,  y  =  b  or  (ii)  1  —  ab  and  1  —  xy  are  squares  of  rational 
numbers.  (Math.  Trip.  1903.) 

19.  If  all  the  values  of  x  and  y  given  by 

ax 2  +  2  lixy  +  by 2  =  1,  ax2  +  2  h!xy  +  6'y2  =  1 
(where  a,  h,  b,  a' ,  /;/ ,  1/  are  rational)  are  rational,  then 

(/i  —  /i')2  —  (a  —  a/)(6  —  67),  (a£/  —  a'6)2  +  4(0^  —  ah)(bh!  —  b'h) 

are  both  squares  of  rational  numbers.  (Math.  Trip.  1899.) 

20.  Show  that  \J~2  and  y/3  are  cubic  functions  of  y/2  +  y/3,  with  rational 
coefficients,  and  that  \/2  —  \/6  +  3  is  the  ratio  of  two  linear  functions  of  \/2  +  \/3- 

(Math.  Trip.  1905.) 
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21.  The  expression 

\]  a  +  2m  ~\/ a  —  m^  +  y^a  —  2m  y/  a  —  m ^ 

is  equal  to  2 m  if  2m2  >  a  >  m2,  and  to  2  Va  —  nr2  if  a  >  2m2. 

22.  Show  that  any  polynomial  in  \/2,  with  rational  coefficients,  can  be  ex¬ 
pressed  in  the  form 

a  +  6  \/2  +  c\/4, 

where  a,  6,  c  are  rational. 

More  generally,  if  p  is  any  rational  number,  any  polynomial  in  with 
rational  coefficients  can  be  expressed  in  the  form 


a0  +  a\a  +  a2a2  H - b  am_iam  1 


where  a o,  ai, . . .  are  rational  and  a  =  ^/p.  For  any  such  polynomial  is  of  the 
form 

b0  +  b\a  +  &2cV  +  •  •  •  +  b^a^ , 

where  the  6’s  are  rational.  If  k  ^  m  —  1,  this  is  already  of  the  form  required.  If 
k  >  m  —  1,  let  ar  be  any  power  of  cc  higher  than  the  (m—  l)th.  Then  r  =  Am+s, 
where  A  is  an  integer  and  0  ^  s  ^  m  —  1;  and  ar  =  aAm+s  =  pxas.  Hence  we 
can  get  rid  of  all  powers  of  a  higher  than  the  (m  —  l)th. 

23.  Express  (s/2  —  l)5  and  (v^2  —  l)/(\/2  +  1)  in  the  form  a  +  b\/ 2  +  c\/4, 
where  a,  b,  c  are  rational.  [Multiply  numerator  and  denominator  of  the  second 
expression  by  —  \/2  +  1 .] 

24.  If 

a  +  6\/2  +  c\f\  =  0, 

where  a,  b,  c  are  rational,  then  a  =  0,  b  =  0,  c  =  0. 

[Let  y  =  \K2.  Then  y3  =  2  and 


cy2  +  by  +  a  =  0. 


Hence  2 cy2  +  2  by  +  ay3  =  0  or 

ay 2  +  2  cy  +  26  =  0. 


Multiplying  these  two  quadratic  equations  by  a  and  c  and  subtracting,  we 
obtain  (a6— 2c2)y+a2— 26c  =  0,  or  y  =  —  (a2  —  2bc)/{ab— 2c2),  a  rational  number, 
which  is  impossible.  The  only  alternative  is  that  ab  —  2 c2  =  0,  a2  —  26c  =  0. 
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Hence  ab  =  2c2,  a4  =  4b2  c2.  If  neither  a  nor  b  is  zero,  we  can  divide  the 
second  equation  by  the  first,  which  gives  a3  =  2 b3:  and  this  is  impossible,  since 
\/2  cannot  be  equal  to  the  rational  number  a/b.  Hence  ab  =  0,  c  =  0,  and  it 
follows  from  the  original  equation  that  a,  b,  and  c  are  all  zero. 

As  a  corollary,  if  a  +  2  +  c\J 4  =  d  +  e\/2  +  f  \J 4,  then  a  =  d,b  =  e:c  =  f. 

It  may  be  proved,  more  generally,  that  if 

a0  +  ai//™  +  •  •  •  +  =  0, 

p  not  being  a  perfect  mth  power,  then  oq  =  a\  =  ■  ■  ■  =  am_i  =  0;  but  the  proof 
is  less  simple.] 

25.  If  A  +  \/l3  =  C  +  \f  D,  then  either  A  =  C,  B  =  D,  or  B  and  D  are  both 
cubes  of  rational  numbers. 

26.  If  A  +  \J B  +  ?/C  =  0,  then  either  one  of  A,  B,  C  is  zero,  and  the 
other  two  equal  and  opposite,  or  \/A,  \fB,  yfC  are  rational  multiples  of  the 
same  surd  \[X . 

27.  Find  rational  numbers  a,  /3  such  that 

\J  7  +  5^2  =  a  +  py/2. 

28.  If  (a  —  b3)b  >  0,  then 


963  +  a  la  —  b3 
3  b  V  3  b 

is  rational.  [Each  of  the  numbers  under  a  cube  root  is  of  the  form 


a  + 


9  b3  + ' 


3  b 


—  b3 


3  b 


+ 


where  a  and  (3  are  rational.] 

29.  If  a  =  ^/p,  any  polynomial  in  a  is  the  root  of  an  equation  of  degree  n, 
with  rational  coefficients. 

[We  can  express  the  polynomial  (x  say)  in  the  form 

O-i) 

1 


x  =  h  +  mi  a  +  •  •  •  +  ria1 
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where  l\,  mi, . . .  are  rational,  as  in  Ex.  22. 
Similarly 


x2  =  h  +  m2a  +  . . .  +  r2a^n 


xn  =  ln  +  mna  +  . . .  +  rna^n  . 


Hence 

L\x  +  L2x2  +  •  •  •  +  Lnxn  =  A, 
where  A  is  the  determinant 


l\  m\  ...  r\ 
l2  m2  ...  r2 


^Yln  •  •  ■  t'n 

and  L\,  L2, . . .  the  minors  of  l\,  l2, . . . .] 

30.  Apply  this  process  to  x  =  p  +  yj q ,  and  deduce  the  theorem  of  §  14. 

31.  Show  that  y  =  a  +  bp1/3  +  cp 2/3  satisfies  the  equation 

y3  —  3  ay2  +  2>y{a2  —  bcp)  —  a3  —  b3p  —  c3p 2  +  3  abcp  =  0. 

32.  Algebraical  numbers.  We  have  seen  that  some  irrational  numbers 
(such  as  \/2)  are  roots  of  equations  of  the  type 

CL ()Xn  +  CL\Xn  +  '  '  '  +  CLn  =  0, 

where  a-o,  a\,  . . . ,  an  are  integers.  Such  irrational  numbers  are  called  algebraical 
numbers:  all  other  irrational  numbers,  such  as  n  (§  15),  are  called  transcendental 
numbers.  Show  that  if  x  is  an  algebraical  number,  then  so  are  kx ,  where  k  is 
any  rational  number,  and  xm^n,  where  rn  and  n  are  any  integers. 

33.  If  x  and  y  are  algebraical  numbers,  then  so  are  x  +  y,  x  —  y,  xy  and  x/y. 
[We  have  equations 

aoxm  +  a\xn>  +  . . .  +  am  =  0, 
bo yn  +  biyn  1  +  . . .  +  bn  =0, 
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where  the  a’s  and  b's  are  integers.  Write  x  +  y  =  z,  y  =  z  —  x  in  the  second,  and 
eliminate  x.  We  thus  get  an  equation  of  similar  form 

c0zp  +  azP-1  H - 1-  cp  =  0, 

satisfied  by  z.  Similarly  for  the  other  cases.] 

34.  If 

C lQXri  +  (l\Xn  +  '  '  '  +  CLn  =  0, 

where  ao,  ai,  ...,  an  are  any  algebraical  numbers,  then  x  is  an  algebraical 
number.  [We  have  n  +  1  equations  of  the  type 

a0)ra™r  +  ai)ra™r_1  H - f-  amr,r  =  0  (r  =  0,  1,  ... ,  n), 

in  which  the  coefficients  ao,r,  «i,r,  •  •  •  are  integers.  Eliminate  ao,  ai,  ...,  an 
between  these  and  the  original  equation  for  x.\ 

35.  Apply  this  process  to  the  equation  x 2  —  2x\J~2  +  \f?>  =  0. 

[The  result  is  x8  —  16x6  +  58x4  —  48x2  +  9  =  0.] 

36.  Find  equations,  with  rational  coefficients,  satisfied  by 

1  +  V2  +  V3,  4|±4I,  ^2+^3. 

\/3  —  v  2 

37.  If  x’3  =  x  +  1,  then  x3n  =  anx  +  bn  +  cn/x ,  where 

®n+l  —  +  bn ,  &n+l  —  +  bn  +  Cn,  Cn_|_i  —  On  +  Cn. 

38.  If  x6  +  x5  —  2x4  —  x3  +  x2  + 1  =  0  and  y  =  x4  —  x2  +  x  —  1,  then  y  satisfies 

a  quadratic  equation  with  rational  coefficients.  (Math.  Trip.  1903.) 

[It  will  be  found  that  y2  +  y  +  1  =  0.] 


CHAPTER  II 


FUNCTIONS  OF  REAL  VARIABLES 

20.  The  idea  of  a  function.  Suppose  that  x  and  y  are  two  contin¬ 
uous  real  variables,  which  we  may  suppose  to  be  represented  geometrically 
by  distances  AqP  =  x,  B0Q  =  y  measured  from  fixed  points  A0,  B0  along 
two  straight  lines  A,  M.  And  let  us  suppose  that  the  positions  of  the  points 
P  and  Q  are  not  independent,  but  connected  by  a  relation  which  we  can 
imagine  to  be  expressed  as  a  relation  between  x  and  y:  so  that,  when 
P  and  x  are  known,  Q  and  y  are  also  known.  We  might,  for  example, 
suppose  that  y  =  x,  or  y  =  2x,  or  |x,  or  x2  +  1.  In  all  of  these  cases 
the  value  of  x  determines  that  of  y.  Or  again,  we  might  suppose  that  the 
relation  between  x  and  y  is  given,  not  by  means  of  an  explicit  formula  for  y 
in  terms  of  x,  but  by  means  of  a  geometrical  construction  which  enables 
us  to  determine  Q  when  P  is  known. 

In  these  circumstances  y  is  said  to  be  a  function  of  x.  This  notion  of 
functional  dependence  of  one  variable  upon  another  is  perhaps  the  most 
important  in  the  whole  range  of  higher  mathematics.  In  order  to  enable 
the  reader  to  be  certain  that  he  understands  it  clearly,  we  shall,  in  this 
chapter,  illustrate  it  by  means  of  a  large  number  of  examples. 

But  before  we  proceed  to  do  this,  we  must  point  out  that  the  simple 
examples  of  functions  mentioned  above  possess  three  characteristics  which 
are  by  no  means  involved  in  the  general  idea  of  a  function,  viz.: 

(1)  y  is  determined  for  every  value  of  x; 

(2)  to  each  value  of  x  for  which  y  is  given  corresponds  one  and  only 
one  value  of  y; 

(3)  the  relation  between  x  and  y  is  expressed  by  means  of  an  analytical 
formula,  from  which  the  value  of  y  corresponding  to  a  given  value  of  x  can 
be  calculated  by  direct  substitution  of  the  latter. 

It  is  indeed  the  case  that  these  particular  characteristics  are  possessed 
by  many  of  the  most  important  functions.  But  the  consideration  of  the 
following  examples  will  make  it  clear  that  they  are  by  no  means  essential 
to  a  function.  All  that  is  essential  is  that  there  should  be  some  relation 
between  x  and  y  such  that  to  some  values  of  x  at  any  rate  correspond 
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values  of  y. 

Examples  X.  1.  Let  y  =  x  or  2x  or  ^ x  or  x2  +  1.  Nothing  further  need 
be  said  at  present  about  cases  such  as  these. 

2.  Let  y  =  0  whatever  be  the  value  of  x.  Then  y  is  a  function  of  x,  for  we 
can  give  x  any  value,  and  the  corresponding  value  of  y  (viz.  0)  is  known.  In  this 
case  the  functional  relation  makes  the  same  value  of  y  correspond  to  all  values 
of  x.  The  same  would  be  true  were  y  equal  to  1  or  —  |  or  \[2  instead  of  0.  Such 
a  function  of  x  is  called  a  constant. 

3.  Let  y2  =  x.  Then  if  x  is  positive  this  equation  defines  two  values  of  y 
corresponding  to  each  value  of  x,  viz.  ±s/x.  If  x  =  0,  y  =  0.  Hence  to  the 
particular  value  0  of  x  corresponds  one  and  only  one  value  of  y.  But  if  x  is 
negative  there  is  no  value  of  y  which  satisfies  the  equation.  That  is  to  say, 
the  function  y  is  not  defined  for  negative  values  of  x.  This  function  therefore 
possesses  the  characteristic  (3),  but  neither  (1)  nor  (2). 

4.  Consider  a  volume  of  gas  maintained  at  a  constant  temperature  and 
contained  in  a  cylinder  closed  by  a  sliding  piston.* 

Let  A  be  the  area  of  the  cross  section  of  the  piston  and  W  its  weight.  The 
gas,  held  in  a  state  of  compression  by  the  piston,  exerts  a  certain  pressure  po  per 
unit  of  area  on  the  piston,  which  balances  the  weight  W,  so  that 

W  =  APo. 

Let  Vo  be  the  volume  of  the  gas  when  the  system  is  thus  in  equilibrium.  If 
additional  weight  is  placed  upon  the  piston  the  latter  is  forced  downwards.  The 
volume  (y)  of  the  gas  diminishes;  the  pressure  (p)  which  it  exerts  upon  unit 
area  of  the  piston  increases.  Boyle’s  experimental  law  asserts  that  the  product 
of  p  and  v  is  very  nearly  constant,  a  correspondence  which,  if  exact,  would  be 
represented  by  an  equation  of  the  type 


pv  =  a,  (i) 

where  a  is  a  number  which  can  be  determined  approximately  by  experiment. 

Boyle’s  law,  however,  only  gives  a  reasonable  approximation  to  the  facts  pro¬ 
vided  the  gas  is  not  compressed  too  much.  When  v  is  decreased  and  p  increased 

*1  borrow  this  instructive  example  from  Prof.  H.  S.  Carslaw’s  Introduction  to  the 
Calculus. 
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beyond  a  certain  point,  the  relation  between  them  is  no  longer  expressed  with 
tolerable  exactness  by  the  equation  (i).  It  is  known  that  a  much  better  approx¬ 
imation  to  the  true  relation  can  then  be  found  by  means  of  what  is  known  as 
‘van  der  Waals’  law’,  expressed  by  the  equation 

(p  +  ^2)  0  -  P)  =  7,  (ii) 

where  a ,  /3,  7  are  numbers  which  can  also  be  determined  approximately  by 
experiment. 

Of  course  the  two  equations,  even  taken  together,  do  not  give  anything  like 
a  complete  account  of  the  relation  between  p  and  v.  This  relation  is  no  doubt 
in  reality  much  more  complicated,  and  its  form  changes,  as  v  varies,  from  a 
form  nearly  equivalent  to  (i)  to  a  form  nearly  equivalent  to  (ii).  But,  from  a 
mathematical  point  of  view,  there  is  nothing  to  prevent  us  from  contemplating 
an  ideal  state  of  things  in  which,  for  all  values  of  v  not  less  than  a  certain 
value  V,  (i)  would  be  exactly  true,  and  (ii)  exactly  true  for  all  values  of  v  less 
than  V .  And  then  we  might  regard  the  two  equations  as  together  defining  p 
as  a  function  of  v.  It  is  an  example  of  a  function  which  for  some  values  of  v  is 
defined  by  one  formula  and  for  other  values  of  v  is  defined  by  another. 

This  function  possesses  the  characteristic  (2);  to  any  value  of  v  only  one 
value  of  p  corresponds:  but  it  does  not  possess  (1).  For  p  is  not  defined  as  a 
function  of  v  for  negative  values  of  v,  a  ‘negative  volume’  means  nothing,  and 
so  negative  values  of  v  do  not  present  themselves  for  consideration  at  all. 

5.  Suppose  that  a  perfectly  elastic  ball  is  dropped  (without  rotation)  from 
a  height  \gr2  on  to  a  fixed  horizontal  plane,  and  rebounds  continually. 

The  ordinary  formulae  of  elementary  dynamics,  with  which  the  reader  is 
probably  familiar,  show  that  h  =  \gt 2  if  0  ^  t  ^  r,  h  =  \g(2r  —  t)2  if  t  ^  ^  3 r, 

and  generally 

h  =  \ g(2nr  —  t )2 

if  (2 n  —  1  )t  S  t  =  (2n  +  l)r,  h  being  the  depth  of  the  ball,  at  time  t,  below  its 
original  position.  Obviously  h  is  a  function  of  t  which  is  only  defined  for  positive 
values  of  t. 

6.  Suppose  that  y  is  defined  as  being  the  largest  prime  factor  of  x.  This 
is  an  instance  of  a  definition  which  only  applies  to  a  particular  class  of  values 
of  x ,  viz.  integral  values.  ‘The  largest  prime  factor  of  -y  or  of  \J~2  or  of  7 r’  means 
nothing,  and  so  our  defining  relation  fails  to  define  for  such  values  of  x  as  these. 
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Thus  this  function  does  not  possess  the  characteristic  (1).  It  does  possess  (2), 
but  not  (3),  as  there  is  no  simple  formula  which  expresses  y  in  terms  of  x. 

7.  Let  y  be  defined  as  the  denominator  of  x  when  x  is  expressed  in  its 
lowest  terms.  This  is  an  example  of  a  function  which  is  defined  if  and  only  if 
x  is  rational.  Thus  y  =  7  if  x  =  — 11/7:  but  y  is  not  defined  for  x  =  \/2,  ‘the 
denominator  of  \[T  being  a  meaningless  form  of  words. 

8.  Let  y  be  defined  as  the  height  in  inches  of  policeman  Cx,  in  the 
Metropolitan  Police,  at  5.30  P.M.  on  8  Aug.  1907.  Then  y  is  defined  for  a 
certain  number  of  integral  values  of  x,  viz.  1,  2,  . . . ,  N,  where  N  is  the  total 
number  of  policemen  in  division  C  at  that  particular  moment  of  time. 

21.  The  graphical  representation  of  functions.  Suppose  that 
the  variable  y  is  a  function  of  the  variable  x.  It  will  generally  be  open  to 
us  also  to  regard  x  as  a  function  of  y,  in  virtue  of  the  functional  relation 
between  x  and  y.  But  for  the  present  we  shall  look  at  this  relation  from 
the  first  point  of  view.  We  shall  then  call  x  the  independent  variable  and  y 
the  dependent  variable ;  and,  when  the  particular  form  of  the  functional 
relation  is  not  specified,  we  shall  express  it  by  writing 

V  =  f(x) 

(or  F(x),  (p(x),  if(x), . . . ,  as  the  case  may  be). 

The  nature  of  particular  functions  may,  in  very  many  cases,  be  illus¬ 
trated  and  made  easily  intelligible  as  follows.  Draw  two  lines  OX,  OY  at 
right  angles  to  one  another  and  produced  indefinitely  in  both  directions. 
We  can  represent  values  of  x  and  y  by  distances  measured  from  O  along 
the  lines  OX,  OY  respectively,  regard  being  paid,  of  course,  to  sign,  and 
the  positive  directions  of  measurement  being  those  indicated  by  arrows  in 
Fig.  6. 

Let  a  be  any  value  of  x  for  which  y  is  defined  and  has  (let  us  suppose)  the 
single  value  b.  Take  OA  =  a,  OB  =  b,  and  complete  the  rectangle  OAPB. 
Imagine  the  point  P  marked  on  the  diagram.  This  marking  of  the  point  P 
may  be  regarded  as  showing  that  the  value  of  y  for  x  =  a  is  b. 

If  to  the  value  a  of  i  correspond  several  values  of  y  (say  b,  b' ,  b "),  we 
have,  instead  of  the  single  point  P,  a  number  of  points  P,  P' ,  P" . 
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Y 


A  X 


Fig.  6. 


We  shall  call  P  the  point  (a,  6);  a  and  b  the  coordinates  of  P  referred 
to  the  axes  OX,  OY;  a  the  abscissa ,  b  the  ordinate  of  P;  OX  and  OY  the 
axis  of  x  and  the  axis  of  y,  or  together  the  axes  of  coordinates ,  and  O  the 
origin  of  coordinates ,  or  simply  the  origin. 

Let  us  now  suppose  that  for  all  values  a  of  x  for  which  y  is  defined, 
the  value  b  (or  values  b ,  b',  b" ,  ...)ofy,  and  the  corresponding  point  P  (or 
points  P,  P',  P" , . . . ),  have  been  determined.  We  call  the  aggregate  of  all 
these  points  the  graph  of  the  function  y. 

To  take  a  very  simple  example,  suppose  that  y  is  defined  as  a  function 
of  x  by  the  equation 

Ax  +  By  +  C  =  0,  (1) 

where  A,  P,  C  are  any  fixed  numbers.*  Then  y  is  a  function  of  x  which 
possesses  all  the  characteristics  (1),  (2),  (3)  of  §  20.  It  is  easy  to  show  that 
the  graph  of  y  is  a  straight  line.  The  reader  is  in  all  probability  familiar 
with  one  or  other  of  the  various  proofs  of  this  proposition  which  are  given 
in  text-books  of  Analytical  Geometry. 

We  shall  sometimes  use  another  mode  of  expression.  We  shall  say  that 

*If  B  =  0,  y  does  not  occur  in  the  equation.  We  must  then  regard  y  as  a  function 
of  x  defined  for  one  value  only  of  x,  viz.  x  =  —C/A,  and  then  having  all  values. 
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when  x  and  y  vary  in  such  a  way  that  equation  (1)  is  always  true,  the  locus 
of  the  point  (x,y)  is  a  straight  line,  and  we  shall  call  (1)  the  equation  of 
the  locus,  and  say  that  the  equation  represents  the  locus.  This  use  of  the 
terms  ‘locus’,  ‘equation  of  the  locus’  is  quite  general,  and  may  be  applied 
whenever  the  relation  between  x  and  y  is  capable  of  being  represented  by 
an  analytical  formula. 

The  equation  Ax  +  By  +  C  =  0  is  the  general  equation  of  the  first  degree, 
for  Ax  +  By  +  C  is  the  most  general  polynomial  in  x  and  y  which  does 
not  involve  any  terms  of  degree  higher  than  the  first  in  x  and  y.  Hence  the 
general  equation  of  the  first  degree  represents  a  straight  line.  It  is  equally 
easy  to  prove  the  converse  proposition  that  the  equation  of  any  straight 
line  is  of  the  first  degree. 

We  may  mention  a  few  further  examples  of  interesting  geometrical  loci 
defined  by  equations.  An  equation  of  the  form 

(*  -  «)2  +  {y-  P?  =  p\ 

or 

x2  +  y2  +  2  Gx  +  2  Fy  +  C  =  0, 
where  G2  +  F2  —  C  >  0,  represents  a  circle.  The  equation 

Ax2  +  2  Hxy  +  By 2  +  2  Gx  +  2  Fy  +  C  =  0 

(the  general  equation  of  the  second  degree )  represents,  assuming  that  the 
coefficients  satisfy  certain  inequalities,  a  conic  section,  i.e.  an  ellipse, 
parabola,  or  hyperbola.  For  further  discussion  of  these  loci  we  must  refer 
to  books  on  Analytical  Geometry. 

22.  Polar  coordinates.  In  what  precedes  we  have  determined  the 
position  of  P  by  the  lengths  of  its  coordinates  OM  —  x,  MP  —  y.  If 
OP  =  r  and  M OP  =  9,6  being  an  angle  between  0  and  27r  (measured  in 
the  positive  direction),  it  is  evident  that 

x  =  r  cos  9,  y  =  r  sin  9, 
r  =  \J x2  +  y2,  cos  9  :  sin 9  :  1  ::  x  :  y  :  r, 
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and  that  the  position  of  P  is  equally  well  determined  by  a  knowledge  of 
r  and  9.  We  call  r  and  9  the  polar  coordinates  of  P.  The  former,  it  should 
be  observed,  is  essentially  positive.* 


If  P  moves  on  a  locus  there  will  be  some  relation  between  r  and  0,  say 
r  =  f(9)  or  9  =  F(r).  This  we  call  the  polar  equation  of  the  locus.  The 
polar  equation  may  be  deduced  from  the  (x,  y )  equation  (or  vice  versa)  by 
means  of  the  formulae  above. 

Thus  the  polar  equation  of  a  straight  line  is  of  the  form 

r  cos {9  —  a)  =  p, 

where  p  and  a  are  constants.  The  equation  r  =  2a  cos  9  represents  a  circle 
passing  through  the  origin;  and  the  general  equation  of  a  circle  is  of  the 
form 

r2  +  c2  —  2rccos(9  —  a)  =  A2, 
where  A,  c,  and  a  are  constants. 

*  Polar  coordinates  are  sometimes  defined  so  that  r  may  be  positive  or  negative.  In 
this  case  two  pairs  of  coordinates — e.g.  (1, 0)  and  (—1, 7r) — correspond  to  the  same  point. 
The  distinction  between  the  two  systems  may  be  illustrated  by  means  of  the  equation 
l/r  =  1  —  ecosR  where  l  >  0,  e  >  1.  According  to  our  definitions  r  must  be  positive 
and  therefore  cos  6  <  1/e:  the  equation  represents  one  branch  only  of  a  hyperbola,  the 
other  having  the  equation  —l/r=  1  —  ecos  9.  With  the  system  of  coordinates  which 
admits  negative  values  of  r,  the  equation  represents  the  whole  hyperbola. 
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23.  Further  examples  of  functions  and  their  graphical  rep¬ 
resentation.  The  examples  which  follow  will  give  the  reader  a  better 
notion  of  the  infinite  variety  of  possible  types  of  functions. 

A.  Polynomials.  A  polynomial  in  x  is  a  function  of  the  form 


aoxm  +  alXm  +  ■  ■  ■  + 


where  do,  a i,  am  are  constants.  The  simplest  polynomials  are  the 

simple  powers  y  =  x,  x2,  x3,  . . . ,  xm, ....  The  graph  of  the  function  xm  is 
of  two  distinct  types,  according  as  m  is  even  or  odd. 

First  let  m  =  2.  Then  three  points  on  the  graph  are  (0,0),  (1,1), 
(— 1, 1).  Any  number  of  additional  points  on  the  graph  may  be  found  by 
assigning  other  special  values  to  x:  thus  the  values 


l 

2’ 


2,  3, 


-2,  3 


give 


If  the  reader  will  plot  off  a  fair  number  of  points  on  the  graph,  he  will  be 
led  to  conjecture  that  the  form  of  the  graph  is  something  like  that  shown  in 
Fig.  8.  If  he  draws  a  curve  through  the  special  points  which  he  has  proved 
to  lie  on  the  graph  and  then  tests  its  accuracy  by  giving  x  new  values,  and 
calculating  the  corresponding  values  of  y,  he  will  find  that  they  lie  as  near 
to  the  curve  as  it  is  reasonable  to  expect,  when  the  inevitable  inaccuracies 
of  drawing  are  considered.  The  curve  is  of  course  a  parabola. 

There  is,  however,  one  fundamental  question  which  we  cannot  answer 
adequately  at  present.  The  reader  has  no  doubt  some  notion  as  to  what 
is  meant  by  a  continuous  curve,  a  curve  without  breaks  or  jumps;  such  a 
curve,  in  fact,  as  is  roughly  represented  in  Fig.  8.  The  question  is  whether 
the  graph  of  the  function  y  =  x2  is  in  fact  such  a  curve.  This  cannot 
be  proved  by  merely  constructing  any  number  of  isolated  points  on  the 
curve,  although  the  more  such  points  we  construct  the  more  probable  it 
will  appear. 
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Fig.  8. 


This  question  cannot  be  discussed  properly  until  Ch.  V.  In  that  chapter 
we  shall  consider  in  detail  what  our  common  sense  idea  of  continuity  really 
means,  and  how  we  can  prove  that  such  graphs  as  the  one  now  considered, 
and  others  which  we  shall  consider  later  on  in  this  chapter,  are  really 
continuous  curves.  For  the  present  the  reader  may  be  content  to  draw  his 
curves  as  common  sense  dictates. 

It  is  easy  to  see  that  the  curve  y  =  x2  is  everywhere  convex  to  the  axis  of  x. 
Let  Po >  Pi  (Fig.  8)  be  the  points  (xq,Xq),  (xi,x2).  Then  the  coordinates  of  a 
point  on  the  chord  PoPi  are  x  =  Xxo  +  fix\,  y  =  Ax[j  +  yx2.  where  A  and  y  are 
positive  numbers  whose  sum  is  1.  And 

y  -  x2  =  (A  +  r)(Axq  +  yx\)  -  (Ax0  +  yx\)2  =  \y(x\  -  x0)2  ^  0, 

so  that  the  chord  lies  entirely  above  the  curve. 

The  curve  y  =  xA  is  similar  to  y  =  x2  in  general  appearance,  but  flatter 
near  O,  and  steeper  beyond  the  points  A,  A'  (Fig.  9),  and  y  =  xm,  where 
m  is  even  and  greater  than  4,  is  still  more  so.  As  m  gets  larger  and  larger 
the  flatness  and  steepness  grow  more  and  more  pronounced,  until  the  curve 
is  practically  indistinguishable  from  the  thick  line  in  the  figure. 

The  reader  should  next  consider  the  curves  given  by  y  —  xm ,  when  m  is 
odd.  The  fundamental  difference  between  the  two  cases  is  that  whereas 
when  m  is  even  (— x)m  =  xm,  so  that  the  curve  is  symmetrical  about  OY, 
when  m  is  odd  (— x)m  =  —  xm,  so  that  y  is  negative  when  x  is  negative. 
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Fig.  10. 


Fig.  10  shows  the  curves  y  =  x,  y  =  x 3,  and  the  form  to  which  y  =  xm 
approximates  for  larger  odd  values  of  m. 

It  is  now  easy  to  see  how  (theoretically  at  any  rate)  the  graph  of  any 
polynomial  may  be  constructed.  In  the  first  place,  from  the  graph  of  y  =  xm 
we  can  at  once  derive  that  of  Cxm ,  where  C  is  a  constant,  by  multiplying 
the  ordinate  of  every  point  of  the  curve  by  C.  And  if  we  know  the  graphs 
of  f(x)  and  F(x),  we  can  find  that  of  f(x)  +  F{x)  by  taking  the  ordinate 
of  every  point  to  be  the  sum  of  the  ordinates  of  the  corresponding  points 
on  the  two  original  curves. 

The  drawing  of  graphs  of  polynomials  is  however  so  much  facilitated  by 
the  use  of  more  advanced  methods,  which  will  be  explained  later  on,  that 
we  shall  not  pursue  the  subject  further  here. 


Examples  XI.  1.  Trace  the  curves  y  =  7x4,  y  =  3x5,  y  =  x10. 

[The  reader  should  draw  the  curves  carefully,  and  all  three  should  be  drawn 
in  one  figure.*  He  will  then  realise  how  rapidly  the  higher  powers  of  x  increase, 

*It  will  be  found  convenient  to  take  the  scale  of  measurement  along  the  axis  of  y  a 
good  deal  smaller  than  that  along  the  axis  of  a;,  in  order  to  prevent  the  figure  becoming 
of  an  awkward  size. 
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as  x  gets  larger  and  larger,  and  will  see  that,  in  such  a  polynomial  as 

x10  +  3x5  +  7x4 

(or  even  x10  +  30x5  +  700x4),  it  is  the  first  term  which  is  of  really  preponderant 
importance  when  x  is  fairly  large.  Thus  even  when  x  =  4,  x10  >  1,000,000, 
while  30x5  <  35,000  and  700x4  <  180,000;  while  if  x  =  10  the  preponderance  of 
the  first  term  is  still  more  marked.] 

2.  Compare  the  relative  magnitudes  of  x12,  l,000,000x6,  1,000, 000, 000, OOOx 
when  x  =  1,  10,  100,  etc. 

[The  reader  should  make  up  a  number  of  examples  of  this  type  for  himself. 
This  idea  of  the  relative  rate  of  growth  of  different  functions  of  x  is  one  with 
which  we  shall  often  be  concerned  in  the  following  chapters.] 

3.  Draw  the  graph  of  ax 2  +  2 bx  +  c. 

[Here  y  —  {( ac  —  b2)/a}  =  a{x  +  ( b/a )}2.  If  we  take  new  axes  parallel  to  the 
old  and  passing  through  the  point  x  =  —b/a,  y  =  (ac  —  b2)/a ,  the  new  equation 
is  y'  =  ax' 2 .  The  curve  is  a  parabola.] 

4.  Trace  the  curves  y  =  x3  —  3x  +  1,  y  =  x2(x  —  1),  y  =  x(x  —  l)2. 


24.  B.  Rational  Functions.  The  class  of  functions  which  ranks 
next  to  that  of  polynomials  in  simplicity  and  importance  is  that  of  rational 
functions.  A  rational  function  is  the  quotient  of  one  polynomial  by  another: 
thus  if  P(x),  Q(x)  are  polynomials,  we  may  denote  the  general  rational 
function  by 


R(x) 


P(x) 
Q(x) ' 


In  the  particular  case  when  Q(x)  reduces  to  unity  or  any  other  constant 
(i.e.  does  not  involve  x),  R(x )  reduces  to  a  polynomial:  thus  the  class  of 
rational  functions  includes  that  of  polynomials  as  a  sub-class.  The  following 
points  concerning  the  definition  should  be  noticed. 

(1)  We  usually  suppose  that  P(x)  and  Q(x)  have  no  common  factor  x  +  a 
or  xp  +  axp~l  +  bxp~2  +  •  •  •  +  k,  all  such  factors  being  removed  by  division. 

(2)  It  should  however  be  observed  that  this  removal  of  common  factors  does 
as  a  rule  change  the  function.  Consider  for  example  the  function  x/x,  which  is  a 
rational  function.  On  removing  the  common  factor  x  we  obtain  1/1  =  1.  But  the 
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original  function  is  not  always  equal  to  1:  it  is  equal  to  1  only  so  long  asr/O. 
If  x  =  0  it  takes  the  form  0/0,  which  is  meaningless.  Thus  the  function  x/x  is 
equal  to  1  if  x  0  and  is  undefined  when  x  =  0.  It  therefore  differs  from  the 
function  1,  which  is  always  equal  to  1. 

(3)  Such  a  function  as 

(x  +  l^x  —  l)/  ( x  x  —  2 

may  be  reduced,  by  the  ordinary  rules  of  algebra,  to  the  form 

x2(x  —  2) 

(x  —  l)2(x  +  1)  ’ 

which  is  a  rational  function  of  the  standard  form.  But  here  again  it  must  be 
noticed  that  the  reduction  is  not  always  legitimate.  In  order  to  calculate  the 
value  of  a  function  for  a  given  value  of  x  we  must  substitute  the  value  for  x  in 
the  function  in  the  form  in  which  it  is  given.  In  the  case  of  this  function  the 
values  x  =  —1,  1,  0,  2  all  lead  to  a  meaningless  expression,  and  so  the  function 
is  not  defined  for  these  values.  The  same  is  true  of  the  reduced  form,  so  far  as 
the  values  —1  and  1  are  concerned.  But  x  =  0  and  x  =  2  give  the  value  0.  Thus 
once  more  the  two  functions  are  not  the  same. 

(4)  But,  as  appears  from  the  particular  example  considered  under  (3),  there 
will  generally  be  a  certain  number  of  values  of  x  for  which  the  function  is  not 
defined  even  when  it  has  been  reduced  to  a  rational  function  of  the  standard 
form.  These  are  the  values  of  x  (if  any)  for  which  the  denominator  vanishes. 
Thus  (x2  —  7)/(x2  —  3x  +  2)  is  not  defined  when  x  =  1  or  2. 

(5)  Generally  we  agree,  in  dealing  with  expressions  such  as  those  considered 
in  (2)  and  (3),  to  disregard  the  exceptional  values  of  x  for  which  such  processes 
of  simplification  as  were  used  there  are  illegitimate,  and  to  reduce  our  function 
to  the  standard  form  of  rational  function.  The  reader  will  easily  verify  that  (on 
this  understanding)  the  sum,  product,  or  quotient  of  two  rational  functions  may 
themselves  be  reduced  to  rational  functions  of  the  standard  type.  And  generally 
a  rational  function  of  a  rational  function  is  itself  a  rational  function:  i.e.  if  in 
z  =  P(y)/Q(y),  where  P  and  Q  are  polynomials,  we  substitute  y  =  Pi(x)/Qi(x), 
we  obtain  on  simplification  an  equation  of  the  form  z  =  P-2(x)/Q2(x). 

(6)  It  is  in  no  way  presupposed  in  the  definition  of  a  rational  function  that 
the  constants  which  occur  as  coefficients  should  be  rational  numbers.  The  word 
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rational  has  reference  solely  to  the  way  in  which  the  variable  x  appears  in  the 
function.  Thus 

x2  +  x  +  \/3 
x\/2  —  7 r 

is  a  rational  function. 

The  use  of  the  word  rational  arises  as  follows.  The  rational  function 
P(x)/Q(x)  may  be  generated  from  x  by  a  finite  number  of  operations  upon  x, 
including  only  multiplication  of  x  by  itself  or  a  constant,  addition  of  terms  thus 
obtained  and  division  of  one  function,  obtained  by  such  multiplications  and 
additions,  by  another.  In  so  far  as  the  variable  x  is  concerned,  this  procedure  is 
very  much  like  that  by  which  all  rational  numbers  can  be  obtained  from  unity, 
a  procedure  exemplified  in  the  equation 

5  _  l  +  l  +  l  +  l  +  l 

3 “  1+1+1 

Again,  any  function  which  can  be  deduced  from  x  by  the  elementary  oper¬ 
ations  mentioned  above  using  at  each  stage  of  the  process  functions  which  have 
already  been  obtained  from  x  in  the  same  way,  can  be  reduced  to  the  stan¬ 
dard  type  of  rational  function.  The  most  general  kind  of  function  which  can  be 
obtained  in  this  way  is  sufficiently  illustrated  by  the  example 

x  2x  +  7 

x2  +  1  +  „  11®  -  3y/2 

X  +  te  +  l 

which  can  obviously  be  reduced  to  the  standard  type  of  rational  function. 

25.  The  drawing  of  graphs  of  rational  functions,  even  more  than  that 
of  polynomials,  is  immensely  facilitated  by  the  use  of  methods  depend¬ 
ing  upon  the  differential  calculus.  We  shall  therefore  content  ourselves  at 
present  with  a  very  few  examples. 

Examples  XII.  1.  Draw  the  graphs  of  y  =  1/x,  y  =  1/x2,  y  =  1/x3,  ... . 

[The  figures  show  the  graphs  of  the  first  two  curves.  It  should  be  observed 
that  since  1/0,  I/O2,  ...  are  meaningless  expressions,  these  functions  are  not 
defined  for  x  =  ().] 
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y  =  l/x2 


Fig.  12. 


2.  Trace  y  =  x  +  (l/x),  x  —  ( l/x ),  x2  +  (l/x2),  x2  —  (l/x2)  and  ax  +  ( b/x ) 
taking  various  values,  positive  and  negative,  for  a  and  b. 

3.  Trace 

x  T  1  /  x  +  1  \  2  1  x2  T  1 

^  x  — 1’  \x  —  l)  ’  (x  —  l)2’  x2  — 1' 

4.  Trace  y  =  l/(x  —  a)(x  —  b),  l/(x  —  a)(x  —  b)(x  —  c),  where  a  <  b  <  c. 

5.  Sketch  the  general  form  assumed  by  the  curves  y  =  l/xm  as  m  becomes 
larger  and  larger,  considering  separately  the  cases  in  which  m  is  odd  or  even. 

26.  C.  Explicit  Algebraical  Functions.  The  next  important 
class  of  functions  is  that  of  explicit  algebraical  functions.  These  are  func¬ 
tions  which  can  be  generated  from  x  by  a  finite  number  of  operations  such 
as  those  used  in  generating  rational  functions,  together  with  a  finite  num¬ 
ber  of  operations  of  root  extraction.  Thus 

2 

\/l  +  x  -  \/l  -  x  (x2  +  x  +  V3\  ‘i 

7r=T?r=,  vW*  +  V5,  (^f —) 

are  explicit  algebraical  functions,  and  so  is  xm^n  (i.e.  \fx ™),  where  m  and  n 
are  any  integers. 
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It  should  be  noticed  that  there  is  an  ambiguity  of  notation  involved 
in  such  an  equation  as  y  —  yfx.  We  have,  up  to  the  present,  regarded 
( e.g .)  y/2  as  denoting  the  positive  square  root  of  2,  and  it  would  be  natural 
to  denote  by  a/x,  where  x  is  any  positive  number,  the  positive  square 
root  of  x,  in  which  case  y  =  yfx  would  be  a  one-valued  function  of  x.  It  is 
however  often  more  convenient  to  regard  a Jx  as  standing  for  the  two- valued 
function  whose  two  values  are  the  positive  and  negative  square  roots  of  x. 

The  reader  will  observe  that,  when  this  course  is  adopted,  the  func¬ 
tion  yfx  differs  fundamentally  from  rational  functions  in  two  respects.  In 
the  first  place  a  rational  function  is  always  defined  for  all  values  of  x  with 
a  certain  number  of  isolated  exceptions.  But  yfx  is  undefined  for  a  whole 
range  of  values  of  x  (i.e.  all  negative  values).  Secondly  the  function,  when 
x  has  a  value  for  which  it  is  defined,  has  generally  two  values  of  opposite 
signs. 

The  function  yfx,  on  the  other  hand,  is  one-valued  and  defined  for  all 
values  of  x. 

Examples  XIII.  1.  y/(x  —  a)(b  —  x ),  where  a  <  b,  is  defined  only  for 
a  ^  x  S  b.  If  a  <  x  <  b  it  has  two  values:  if  x  =  a  or  b  only  one,  viz.  0. 

2.  Consider  similarly 

a/(x  —  a)(x  —  b)(x  —  c)  (a  <  b  <  c), 
a/x(x2  —  a2),  \/(x  —  a)2(b  —  x)  (a  <b), 

y/1  +  X  -  \/\  —  X 
\/l  +  X  +  \/l  —  x' 

3.  Trace  the  curves  y 2  =  x,  y3  =  x,  y 2  =  x3. 

4.  Draw  the  graphs  of  the  functions 

y  =  V a2  —  x2,  y  =  by/l  —  ( x2/a 2). 

27.  D.  Implicit  Algebraical  Functions.  It  is  easy  to  verify  that 
if 

_  y/1  +  X  -  \/l  -  X 

y  \/T ~+x  +  v'l  —  x’ 


X  +  a/ X. 
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then 

(1+vY- 

(1  +  x)3 

VI- y) 

(1  —  x)2 

or  if 

/ - 

y=Vx+\ 

lx  +  y/x, 

then 

y 4  —  (Ay2  +  Ay  +  l)x  =  0. 


Each  of  these  equations  may  be  expressed  in  the  form 

ym  +  Riym~'  +  •  •  •  +  Rm  =  0,  (1) 

where  R.i,  i?2!  •  ••,  Rm  are  rational  functions  of  x:  and  the  reader  will 
easily  verify  that,  if  y  is  any  one  of  the  functions  considered  in  the  last  set 
of  examples,  y  satisfies  an  equation  of  this  form.  It  is  naturally  suggested 
that  the  same  is  true  of  any  explicit  algebraic  function.  And  this  is  in  fact 
true,  and  indeed  not  difficult  to  prove,  though  we  shall  not  delay  to  write 
out  a  formal  proof  here.  An  example  should  make  clear  to  the  reader  the 
lines  on  which  such  a  proof  would  proceed.  Let 

x  +  \fx  +  \J  x  +  y/x  +  \7l  +  x 
X  —  yfx  +  \fx~+  \fx  —  y/l  +  X 

Then  we  have  the  equations 

x  +  u  +  v  +  w 

y  = - - - , 

x  —  u  +  v  —  w 

u2  =  x,  v2  =  x  +  u,  w3  =  1  +  X, 

and  we  have  only  to  eliminate  u,  v,  w  between  these  equations  in  order  to 
obtain  an  equation  of  the  form  desired. 

We  are  therefore  led  to  give  the  following  definition:  a  function  y  =  f(x) 
will  be  said  to  be  an  algebraical  function  of  x  if  it  is  the  root  of  an  equation 
such  as  (1),  i.e.  the  root  of  an  equation  of  the  rnth  degree  in  y,  whose 
coefficients  are  rational  functions  of  x.  There  is  plainly  no  loss  of  generality 
in  supposing  the  first  coefficient  to  be  unity. 
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This  class  of  functions  includes  all  the  explicit  algebraical  functions 
considered  in  §  26.  But  it  also  includes  other  functions  which  cannot  be 
expressed  as  explicit  algebraical  functions.  For  it  is  known  that  in  general 
such  an  equation  as  (1)  cannot  be  solved  explicitly  for  y  in  terms  of  x,  when 
m  is  greater  than  4,  though  such  a  solution  is  always  possible  if  m  =  1, 
2,  3,  or  4  and  in  special  cases  for  higher  values  of  m. 

The  definition  of  an  algebraical  function  should  be  compared  with  that 
of  an  algebraical  number  given  in  the  last  chapter  (Misc.  Exs.  32). 

Examples  XIV.  1.  If  m  =  1,  y  is  a  rational  function. 

2.  If  m  =  2,  the  equation  is  y 2  +  R\y  +  R2  =  0,  so  that 

v  =  \{-Ri  ±  \JrI  -  4i?2}. 

This  function  is  defined  for  all  values  of  x  for  which  R2  [k  4i?2.  It  has  two  values 
if  R2  >  4i?2  and  one  if  R2  =  4i?2- 

If  m  =  3  or  4,  we  can  use  the  methods  explained  in  treatises  on  Algebra 
for  the  solution  of  cubic  and  biquadratic  equations.  But  as  a  rule  the  process 
is  complicated  and  the  results  inconvenient  in  form,  and  we  can  generally  study 
the  properties  of  the  function  better  by  means  of  the  original  equation. 

3.  Consider  the  functions  defined  by  the  equations 

y2  -  2y  -  x2  =  0,  y2  -  2y  +  x2  =  0,  y 4  -  2 y2  +  x2  =  0, 

in  each  case  obtaining  y  as  an  explicit  function  of  x,  and  stating  for  what  values 
of  x  it  is  defined. 

4.  Find  algebraical  equations,  with  coefficients  rational  in  x,  satisfied  by 
each  of  the  functions 


5.  Consider  the  equation  yA  =  x2. 

[Here  y2  =  ±x.  If  x  is  positive,  y  =  yj. x :  if  negative,  y  =  \f—x.  Thus  the 
function  has  two  values  for  all  values  of  x  save  x  =  0.] 

6.  An  algebraical  function  of  an  algebraical  function  of  x  is  itself  an  alge¬ 
braical  function  of  x. 


[II :  28] 


FUNCTIONS  OF  REAL  VARIABLES 


60 


[For  we  have 


ym  +  Ri(z)ym  1  +  . . .  +  Rm(z)  —  0, 

where 

z11  +  S\{x)zn  1  +  . . .  +  Sn{x )  =  0. 

Eliminating  z  we  find  an  equation  of  the  form 

if +  Tx(x)yp-X  +...  +  Tp(x)  =0. 

Here  all  the  capital  letters  denote  rational  functions.] 

7.  An  example  should  perhaps  be  given  of  an  algebraical  function  which 
cannot  be  expressed  in  an  explicit  algebraical  form.  Such  an  example  is  the 
function  y  defined  by  the  equation 

y5  —  y  —  x  =  0. 

But  the  proof  that  we  cannot  find  an  explicit  algebraical  expression  for  y  in 
terms  of  x  is  difficult,  and  cannot  be  attempted  here. 

28.  Transcendental  functions.  All  functions  of  x  which  are  not 

rational  or  even  algebraical  are  called  transcendental  functions.  This  class 
of  functions,  being  defined  in  so  purely  negative  a  manner,  naturally  in¬ 
cludes  an  infinite  variety  of  whole  kinds  of  functions  of  varying  degrees  of 
simplicity  and  importance.  Among  these  we  can  at  present  distinguish  two 
kinds  which  are  particularly  interesting. 

E.  The  direct  and  inverse  trigonometrical  or  circular  func¬ 
tions.  These  are  the  sine  and  cosine  functions  of  elementary  trigonometry, 
and  their  inverses,  and  the  functions  derived  from  them.  We  may  assume 
provisionally  that  the  reader  is  familiar  with  their  most  important  proper¬ 
ties.* 

*The  definitions  of  the  circular  functions  given  in  elementary  trigonometry  presup¬ 
pose  that  any  sector  of  a  circle  has  associated  with  it  a  definite  number  called  its  area. 
How  this  assumption  is  justified  will  appear  in  Ch.  VII. 
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Examples  XV.  1.  Draw  the  graphs  of  cosx,  sin  a;,  and  a  cosx +  6  sinx. 

[Since  acosx  +  6sinx  =  /?cos(x  — a),  where  /3  =  y/a2  +  b2,  and  a  is  an  angle 
whose  cosine  and  sine  are  a/  y/a2  +  b2  and  b/  y/ a2  +  b2 ,  the  graphs  of  these  three 
functions  are  similar  in  character.] 

2.  Draw  the  graphs  of  cos2  x.  sin2  x.  a  cos2  x  +  6  sin2  x. 

3.  Suppose  the  graphs  of  /(x)  and  F(x)  drawn.  Then  the  graph  of 

/(x)  cos2  x  +  F(x)  sin2  x 

is  a  wavy  curve  which  oscillates  between  the  curves  y  =  /(x),  y  =  F(x).  Draw 
the  graph  when  /(x)  =  x,  F(x)  =  x2. 

4.  Show  that  the  graph  of  cospx+cos  qx  lies  between  those  of  2  cos  \{p—q)x 

and  —2  cos  \  (p+q)x,  touching  each  in  turn.  Sketch  the  graph  when  ( p—q ) / ( p+q ) 
is  small.  (Math.  Trip.  1908.) 

5.  Draw  the  graphs  of  x  +  sinx,  (1/x)  +  sinx,  xsinx,  (sinx)/x. 

6.  Draw  the  graph  of  sin(l/x). 

[If  y  =  sin(l/x),  then  y  =  0  when  x  =  l/rmr ,  where  m  is  any  integer. 
Similarly  y  =  1  when  x  =  1/(2 m  +  |)7r  and  y  =  —1  when  x  =  1/(2 m  —  \)k. 
The  curve  is  entirely  comprised  between  the  lines  y  =  —  1  and  y  =  1  (Fig.  13). 
It  oscillates  up  and  down,  the  rapidity  of  the  oscillations  becoming  greater  and 
greater  as  x  approaches  0.  For  x  =  0  the  function  is  undefined.  When  x  is  large 
y  is  small.*  The  negative  half  of  the  curve  is  similar  in  character  to  the  positive 
half.] 

7.  Draw  the  graph  of  x  sin(l/x). 

[This  curve  is  comprised  between  the  lines  y  =  — x  and  y  =  x  just  as  the  last 
curve  is  comprised  between  the  lines  y  =  —  1  and  y  =  1  (Fig.  14).] 

8.  Draw  the  graphs  of  x2  sin(l/x),  (1/x)  sin(l/x),  sin2(l/x),  (xsin(l/x)}2, 
a  cos2  (1/x)  +  b  sin2  (1/x),  sinx  +  sin(l/x),  sinxsin(l/x). 

9.  Draw  the  graphs  of  cosx2,  sinx2,  a cosx2  +  b sinx2. 

10.  Draw  the  graphs  of  arc  cosx  and  arc  sinx. 

[If  y  =  arc  cos  x,  x  =  cos  y .  This  enables  us  to  draw  the  graph  of  x,  considered 
as  a  function  of  y,  and  the  same  curve  shows  y  as  a  function  of  x.  It  is  clear 
that  y  is  only  defined  for  —  1  ^  x  ^  1,  and  is  infinitely  many- valued  for  these 
values  of  x.  As  the  reader  no  doubt  remembers,  there  is,  when  —1  <  x  <  1,  a 


See  Chs.  IV  and  V  for  explanations  as  to  the  precise  meaning  of  this  phrase. 
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Fig.  13. 


Fig.  14. 


value  of  y  between  0  and  tt,  say  a,  and  the  other  values  of  y  are  given  by  the 
formula  2mr  ±  a,  where  n  is  any  integer,  positive  or  negative.] 

11.  Draw  the  graphs  of 

tanx,  cotx,  secx,  cosec  x,  tan2x,  cot2  x,  sec2  x,  cosec2  x. 

12.  Draw  the  graphs  of  arc  tan  x,  arc  cotx,  arc  sec  x,  arc  cosec  x.  Give  for¬ 
mulae  (as  in  Ex.  10)  expressing  all  the  values  of  each  of  these  functions  in  terms 
of  any  particular  value. 

13.  Draw  the  graphs  of  tan(l/x),  cot(l/x),  sec(l/x),  cosec(l/x). 

14.  Show  that  cosx  and  sinx  are  not  rational  functions  of  x. 

[A  function  is  said  to  be  periodic ,  with  period  a,  if  /(x)  =  f(x  +  a)  for  all 
values  of  x  for  which  /(x)  is  defined.  Thus  cosx  and  sinx  have  the  period  27t. 
It  is  easy  to  see  that  no  periodic  function  can  be  a  rational  function,  unless  it  is 
a  constant.  For  suppose  that 

/(x)  =  P{x)/Q[x), 

where  P  and  Q  are  polynomials,  and  that  /(x)  =  /(x+a),  each  of  these  equations 
holding  for  all  values  of  x.  Let  /( 0)  =  k.  Then  the  equation  P(x)  —  kQ(x )  =  0 
is  satisfied  by  an  infinite  number  of  values  of  x,  viz.  x  =  0,  a,  2a,  etc.,  and 
therefore  for  all  values  of  x.  Thus  /(x)  =  k  for  all  values  of  x,  i.e.  f(x)  is  a 
constant.] 

15.  Show,  more  generally,  that  no  function  with  a  period  can  be  an  alge¬ 
braical  function  of  x. 
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[Let  the  equation  which  defines  the  algebraical  function  be 

ym  +  Riym  1  +  ■  •  •  +  Rm  =  0  (1) 

where  R\,  ...  are  rational  functions  of  x.  This  may  be  put  in  the  form 

P0ym  +  Piy171-1  +  •  •  •  +  Pm  =  0, 

where  Po,  Pi,  •  •  •  are  polynomials  in  x.  Arguing  as  above,  we  see  that 

P0km  +  P ifc™-1  +  •  •  •  +  Pm  =  0 

for  all  values  of  x.  Hence  y  =  k  satisfies  the  equation  (1)  for  all  values  of  x,  and 
one  set  of  values  of  our  algebraical  function  reduces  to  a  constant. 

Now  divide  (1)  by  y  —  k  and  repeat  the  argument.  Our  final  conclusion 
is  that  our  algebraical  function  has,  for  any  value  of  x,  the  same  set  of  values 
k,  k' ,  . . . ;  i.e.  it  is  composed  of  a  certain  number  of  constants.] 

16.  The  inverse  sine  and  inverse  cosine  are  not  rational  or  algebraical  func¬ 
tions.  [This  follows  from  the  fact  that,  for  any  value  of  x  between  —1  and  +1, 
arc  sin  x  and  arc  cos  x  have  infinitely  many  values.] 

29.  F.  Other  classes  of  transcendental  functions.  Next  in 
importance  to  the  trigonometrical  functions  come  the  exponential  and  log¬ 
arithmic  functions,  which  will  be  discussed  in  Chs.  IX  and  X.  But  these 
functions  are  beyond  our  range  at  present.  And  most  of  the  other  classes  of 
transcendental  functions  whose  properties  have  been  studied,  such  as  the  el¬ 
liptic  functions,  Bessel’s  and  Legendre’s  functions,  Gamma- functions,  and 
so  forth,  lie  altogether  beyond  the  scope  of  this  book.  There  are  however 
some  elementary  types  of  functions  which,  though  of  much  less  importance 
theoretically  than  the  rational,  algebraical,  or  trigonometrical  functions, 
are  particularly  instructive  as  illustrations  of  the  possible  varieties  of  the 
functional  relation. 

Examples  XVI.  1.  Let  y  =  [x],  where  [x\  denotes  the  greatest  integer 
not  greater  than  x.  The  graph  is  shown  in  Fig.  15a.  The  left-hand  end  points 
of  the  thick  lines,  but  not  the  right-hand  ones,  belong  to  the  graph. 

2.  y  =  x  —  [x].  (Fig.  15b.) 
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0  1  2 

Fig.  15a. 


Fig.  15b. 


3.  y  =  y/x  —  [x] .  (Fig.  15c.) 

4.  y  =  [x]  +  y/x  —  [x] .  (Fig.  15d.) 

5.  y  =  {x—  [x])2,  [x\  +  (x—  [x])2. 

6.  y  =  [v/i],  [x2],  yfx  -  [y/x\,  x2  - 


Fig.  15c. 


[X2],  [l-*2]. 


7.  Let  y  be  defined  as  the  largest  prime  factor  of  x  (cf.  Exs.  X.  6).  Then 
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y  is  defined  only  for  integral  values  of  x.  If 

x  =  1,  2,  3,  4,  5,  6,  7,  8,  9,  10,  11,  12,  13, 

then 

y  =  1,  2,  3,  2,  5,  3,  7,  2,  3,  5,  11,  3,  13,  .... 

The  graph  consists  of  a  number  of  isolated  points. 

8.  Let  y  be  the  denominator  of  x  (Exs.  X.  7).  In  this  case  y  is  defined  only 
for  rational  values  of  x.  We  can  mark  off  as  many  points  on  the  graph  as  we 
please,  but  the  result  is  not  in  any  ordinary  sense  of  the  word  a  curve,  and  there 
are  no  points  corresponding  to  any  irrational  values  of  x. 

Draw  the  straight  line  joining  the  points  ( N  —  1  ,N),  (N,N),  where  ?V  is  a 
positive  integer.  Show  that  the  number  of  points  of  the  locus  which  lie  on  this 
line  is  equal  to  the  number  of  positive  integers  less  than  and  prime  to  N. 

9.  Let  y  =  0  when  x  is  an  integer,  y  =  x  when  x  is  not  an  integer.  The 
graph  is  derived  from  the  straight  line  y  =  x  by  taking  out  the  points 

...  (-1,-1),  (0,0),  (1,1),  (2,2),  ... 

and  adding  the  points  (—1,0),  (0,0),  (1,0),  ...  on  the  axis  of  x. 

The  reader  may  possibly  regard  this  as  an  unreasonable  function.  Why,  he 
may  ask,  if  y  is  equal  to  x  for  all  values  of  x  save  integral  values,  should  it  not 
be  equal  to  x  for  integral  values  too?  The  answer  is  simply,  why  should  it ?  The 
function  y  does  in  point  of  fact  answer  to  the  definition  of  a  function:  there 
is  a  relation  between  x  and  y  such  that  when  x  is  known  y  is  known.  We  are 
perfectly  at  liberty  to  take  this  relation  to  be  what  we  please,  however  arbitrary 
and  apparently  futile.  This  function  y  is,  of  course,  a  quite  different  function 
from  that  one  which  is  always  equal  to  x,  whatever  value,  integral  or  otherwise, 
x  may  have. 

10.  Let  y  =  1  when  x  is  rational,  but  y  =  0  when  x  is  irrational.  The  graph 
consists  of  two  series  of  points  arranged  upon  the  lines  y  =  1  and  y  =  0.  To  the 
eye  it  is  not  distinguishable  from  two  continuous  straight  lines,  but  in  reality  an 
infinite  number  of  points  are  missing  from  each  line. 

11.  Let  y  =  x  when  x  is  irrational  and  y  =  y?(l  +  p2)/(  1  +  q 2)  when  x  is  a 
rational  fraction  p/q. 
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Fig.  16. 


The  irrational  values  of  x  contribute  to  the  graph  a  curve  in  reality  discon¬ 
tinuous,  but  apparently  not  to  be  distinguished  from  the  straight  line  y  =  x. 

Now  consider  the  rational  values  of  x.  First  let  x  be  positive.  Then 
^/(l  +p2)/(  1  +  q 2)  cannot  be  equal  to  p/q  unless  p  =  q,  i.e.  x  =  1.  Thus 
all  the  points  which  correspond  to  rational  values  of  x  lie  off  the  line,  ex¬ 
cept  the  one  point  (1, 1).  Again,  if  p  <  q,  y/(l  +  p2)/(  1  +  q 2)  >  p/q;  if  p  >  q, 
y^(l  +p2)/(  1  +  q 2)  <  p/q.  Thus  the  points  lie  above  the  line  y  =  xifO<x<l, 
below  if  x  >  1.  If  p  and  q  are  large,  yj(\  +p2)/{  1  +  q2)  is  nearly  equal  to  p/q. 
Near  any  value  of  x  we  can  find  any  number  of  rational  fractions  with  large 
numerators  and  denominators.  Hence  the  graph  contains  a  large  number  of 
points  which  crowd  round  the  line  y  =  x.  Its  general  appearance  (for  positive 
values  of  x)  is  that  of  a  line  surrounded  by  a  swarm  of  isolated  points  which 
gets  denser  and  denser  as  the  points  approach  the  line. 

The  part  of  the  graph  which  corresponds  to  negative  values  of  x  consists 
of  the  rest  of  the  discontinuous  line  together  with  the  reflections  of  all  these 
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isolated  points  in  the  axis  of  y.  Thus  to  the  left  of  the  axis  of  y  the  swarm  of 
points  is  not  round  y  =  x  but  round  y  =  —x,  which  is  not  itself  part  of  the 
graph.  See  Fig.  16. 

30.  Graphical  solution  of  equations  containing  a  single  un¬ 
known  number.  Many  equations  can  be  expressed  in  the  form 

f(x)  =  00),  (1) 

where  f(x )  and  0(x)  are  functions  whose  graphs  are  easy  to  draw.  And  if 
the  curves 

V  =  f  0),  V  =  </>0) 

intersect  in  a  point  P  whose  abscissa  is  £,  then  £  is  a  root  of  the  equa¬ 
tion  (1). 

Examples  XVII.  1.  The  quadratic  equation  ax2  +  2 bx  +  c  =  0. 

This  may  be  solved  graphically  in  a  variety  of  ways.  For  instance  we  may  draw 
the  graphs  of 

y  =  ax  +  2b,  y  =  -c/x, 

whose  intersections,  if  any,  give  the  roots.  Or  we  may  take 

y  =  x2,  y  =  —  (2bx  +  c)/a. 

But  the  most  elementary  method  is  probably  to  draw  the  circle 

a(x2  +  y2)  +  2  bx  +  c  =  0, 

whose  centre  is  (— b/a,  0)  and  radius  {Vb2  —  ac}/a.  The  abscissae  of  its  inter¬ 
sections  with  the  axis  of  x  are  the  roots  of  the  equation. 

2.  Solve  by  any  of  these  methods 

x2  +  2x  —  3  =  0,  x2  —  lx  +  4  =  0,  3x2  +  2x  —  2  =  0. 

3.  The  equation  xm  +  ax  +  b  =  0.  This  may  be  solved  by  constructing 
the  curves  y  =  xm,  y  =  — ax  —  b.  Verify  the  following  table  for  the  number  of 
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roots  of 


xm  +  ax  +  b  =  0  : 


(6) rn  odd 


( a)m  even 


b  positive,  two  or  none , 
b  negative,  two ; 

a  positive,  one, 
a  negative,  three  or  one. 


Construct  numerical  examples  to  illustrate  all  possible  cases. 

4.  Show  that  the  equation  tan  x  =  ax  +  b  has  always  an  infinite  number  of 
roots. 

5.  Determine  the  number  of  roots  of 


6.  Show  that  if  a  is  small  and  positive  ( e.g .  a  =  .01),  the  equation 


has  three  roots.  Consider  also  the  case  in  which  a  is  small  and  negative.  Explain 
how  the  number  of  roots  varies  as  a  varies. 

31.  Functions  of  two  variables  and  their  graphical  represen¬ 
tation.  In  §  20  we  considered  two  variables  connected  by  a  relation.  We 
may  similarly  consider  three  variables  (x,  y ,  and  z)  connected  by  a  rela¬ 
tion  such  that  when  the  values  of  x  and  y  are  both  given,  the  value  or 
values  of  z  are  known.  In  this  case  we  call  z  a  function  of  the  two  variables 
x  and  y,  x  and  y  the  independent  variables,  z  the  dependent  variable;  and 
we  express  this  dependence  of  z  upon  x  and  y  by  writing 


z  =  f(x,y). 


The  remarks  of  §  20  may  all  be  applied,  mutatis  mutandis ,  to  this  more 
complicated  case. 

The  method  of  representing  such  functions  of  two  variables  graphically 
is  exactly  the  same  in  principle  as  in  the  case  of  functions  of  a  single  vari¬ 
able.  We  must  take  three  axes,  OX,  OY ,  OZ  in  space  of  three  dimensions, 
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each  axis  being  perpendicular  to  the  other  two.  The  point  (a,  b,  c )  is  the 
point  whose  distances  from  the  planes  YOZ,  ZOX,  XOY,  measured  par¬ 
allel  to  OX,  OY ,  OZ ,  are  a,  b,  and  c.  Regard  must  of  course  be  paid  to 
sign,  lengths  measured  in  the  directions  OX,  OY,  OZ  being  regarded  as 
positive.  The  definitions  of  coordinates,  axes,  origin  are  the  same  as  before. 
Now  let 

*  =  f(x,  y)- 

As  x  and  y  vary,  the  point  (x,  y,  z )  will  move  in  space.  The  aggregate  of 
all  the  positions  it  assumes  is  called  the  locus  of  the  point  (x,  y,  z )  or  the 
graph  of  the  function  z  =  f(x,y).  When  the  relation  between  x,  y,  and  z 
which  defines  z  can  be  expressed  in  an  analytical  formula,  this  formula  is 
called  the  equation  of  the  locus.  It  is  easy  to  show,  for  example,  that  the 
equation 

Ax  +  By  +  Cz  +  D  =  0 

( the  general  equation  of  the  first  degree )  represents  a  plane,  and  that  the 
equation  of  any  plane  is  of  this  form.  The  equation 

(x  ~  «)2  +  (y  -  fi)2  +  (z  -  7)2  =  P2, 

or 

x2  +  y2  +  z2  +  2  Fx  +  2  Gy  +  2  Hz  +  C  =  0, 

where  F2  +  G2  +  H2  —  C  >  0,  represents  a  sphere ;  and  so  on.  For  proofs 
of  these  propositions  we  must  again  refer  to  text-books  of  Analytical  Ge¬ 
ometry. 

32.  Curves  in  a  plane.  We  have  hitherto  used  the  notation 

y  =  }{x)  (i) 

to  express  functional  dependence  of  y  upon  x.  It  is  evident  that  this  no¬ 
tation  is  most  appropriate  in  the  case  in  which  y  is  expressed  explicitly  in 
terms  of  x  by  means  of  a  formula,  as  when  for  example 


V  =  x2, 


sin  x,  a  cos2  x  +  b  sin2  x. 
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We  have  however  very  often  to  deal  with  functional  relations  which 
it  is  impossible  or  inconvenient  to  express  in  this  form.  If.  for  example, 
y5  —  y  —  x  =  0  or  x5  +  y5  —  ay  =  0,  it  is  known  to  be  impossible  to  express  y 
explicitly  as  an  algebraical  function  of  x.  If 

x2  +  y2  +  2  Gx  +  2  Fy  +  C  =  0, 
y  can  indeed  be  so  expressed,  viz.  by  the  formula 

y  =  -F+  VF2  -x2-  2 Gx  -  C\ 

but  the  functional  dependence  of  y  upon  x  is  better  and  more  simply 
expressed  by  the  original  equation. 

It  will  be  observed  that  in  these  two  cases  the  functional  relation  is 
fully  expressed  by  equating  a  function  of  the  two  variables  x  and  y  to  zero, 
i.e.  by  means  of  an  equation 


f(x,y)  =  o.  (2) 

We  shall  adopt  this  equation  as  the  standard  method  of  expressing  the 
functional  relation.  It  includes  the  equation  (1)  as  a  special  case,  since 
y  —  f(pc)  is  a  special  form  of  a  function  of  x  and  y.  We  can  then  speak 
of  the  locus  of  the  point  (x,y)  subject  to  f(x,y)  =  0,  the  graph  of  the 
function  y  defined  by  f(x,y)  =  0,  the  curve  or  locus  f(x,y)  =  0,  and  the 
equation  of  this  curve  or  locus. 

There  is  another  method  of  representing  curves  which  is  often  useful. 
Suppose  that  x  and  y  are  both  functions  of  a  third  variable  t,  which  is  to  be 
regarded  as  essentially  auxiliary  and  devoid  of  any  particular  geometrical 
significance.  We  may  write 

x  =  f(t ),  y  =  F(t).  (3) 

If  a  particular  value  is  assigned  to  t,  the  corresponding  values  of  x  and 
of  y  are  known.  Each  pair  of  such  values  defines  a  point  (x,y).  If  we 
construct  all  the  points  which  correspond  in  this  way  to  different  values 
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of  t,  we  obtain  the  graph  of  the  locus  defined  by  the  equations  (3).  Suppose 
for  example 

x  =  a  cos  t,  y  =  a  sin  t. 

Let  t  vary  from  0  to  2tc.  Then  it  is  easy  to  see  that  the  point  (x,  y)  describes 
the  circle  whose  centre  is  the  origin  and  whose  radius  is  a.  If  t  varies  beyond 
these  limits,  (x,  y)  describes  the  circle  over  and  over  again.  We  can  in  this 
case  at  once  obtain  a  direct  relation  between  x  and  y  by  squaring  and 
adding:  we  find  that  x2  +  y2  =  a2,  t  being  now  eliminated. 

Examples  XVIII.  1.  The  points  of  intersection  of  the  two  curves  whose 
equations  are  f(x,y)  =  0,  f(x,y)  =  0,  where  /  and  f  are  polynomials,  can  be 
determined  if  these  equations  can  be  solved  as  a  pair  of  simultaneous  equations 
in  x  and  y.  The  solution  generally  consists  of  a  finite  number  of  pairs  of  values 
of  x  and  y.  The  two  equations  therefore  generally  represent  a  finite  number  of 
isolated  points. 

2.  Trace  the  curves  (x  +  y)2  =  1,  xy  =  1,  x2  —  y2  =  1. 

3.  The  curve  f(x,  y)  +  X(j)(x,  y)  =  0  represents  a  curve  passing  through  the 
points  of  intersection  of  /  =  0  and  <f>  =  0. 

4.  What  loci  are  represented  by 

(cc)  x  =  at  +  6,  y  =  ct  +  d,  (/3)  x/a  =  2f/(l  +  t2),  y/a  =  (1  —  t2)/(l  + 12), 
when  t  varies  through  all  real  values? 

33.  Loci  in  space.  In  space  of  three  dimensions  there  are  two  fun¬ 
damentally  different  kinds  of  loci,  of  which  the  simplest  examples  are  the 
plane  and  the  straight  line. 

A  particle  which  moves  along  a  straight  line  has  only  one  degree  of 
freedom.  Its  direction  of  motion  is  fixed;  its  position  can  be  completely 
fixed  by  one  measurement  of  position,  e.g.  by  its  distance  from  a  fixed 
point  on  the  line.  If  we  take  the  line  as  our  fundamental  line  A  of  Chap.  I, 
the  position  of  any  of  its  points  is  determined  by  a  single  coordinate  x. 
A  particle  which  moves  in  a  plane,  on  the  other  hand,  has  two  degrees 
of  freedom;  its  position  can  only  be  fixed  by  the  determination  of  two 
coordinates. 
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A  locus  represented  by  a  single  equation 

z  =  f(x,y) 

plainly  belongs  to  the  second  of  these  two  classes  of  loci,  and  is  called  a 
surface.  It  may  or  may  not  (in  the  obvious  simple  cases  it  will)  satisfy  our 
common-sense  notion  of  what  a  surface  should  be. 

The  considerations  of  §  31  may  evidently  be  generalised  so  as  to  give 
definitions  of  a  function  f(x,y,z)  of  three  variables  (or  of  functions  of  any 
number  of  variables).  And  as  in  §  32  we  agreed  to  adopt  f(x,  y)  —  0  as  the 
standard  form  of  the  equation  of  a  plane  curve,  so  now  we  shall  agree  to 
adopt 

f  (x,y,z)  =  0 

as  the  standard  form  of  equation  of  a  surface. 

The  locus  represented  by  two  equations  of  the  form  z  =  f(x,y )  or 
f(x,y,z)  =  0  belongs  to  the  first  class  of  loci,  and  is  called  a  curve. 
Thus  a  straight  line  may  be  represented  by  two  equations  of  the  type 
Ax  +  By  +  Cz  +  D  =  0.  A  circle  in  space  may  be  regarded  as  the 
intersection  of  a  sphere  and  a  plane;  it  may  therefore  be  represented  by 
two  equations  of  the  forms 

(x  —  a)2  +  (y  —  /3)2  +  (z  —  y)2  =  p2,  Ax  +  By  +  Cz  +  D  =  0. 

Examples  XIX.  1.  What  is  represented  by  three  equations  of  the  type 
f(x,y,z )  =  0? 

2.  Three  linear  equations  in  general  represent  a  single  point.  What  are  the 
exceptional  cases? 

3.  What  are  the  equations  of  a  plane  curve  f(x,  y)  =  0  in  the  plane  XOY, 
when  regarded  as  a  curve  in  space?  [f(x,  y)  =  0,  z  =  0.] 

4.  Cylinders.  What  is  the  meaning  of  a  single  equation  f(x,  y )  =  0, 
considered  as  a  locus  in  space  of  three  dimensions? 

[All  points  on  the  surface  satisfy  f{x,  y)  =  0,  whatever  be  the  value  of  2.  The 
curve  f(x,y )  =  0,  z  =  0  is  the  curve  in  which  the  locus  cuts  the  plane  XOY. 
The  locus  is  the  surface  formed  by  drawing  lines  parallel  to  OZ  through  all 
points  of  this  curve.  Such  a  surface  is  called  a  cylinder .] 
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5.  Graphical  representation  of  a  surface  on  a  plane.  Contour 
Maps.  It  might  seem  to  be  impossible  to  represent  a  surface  adequately  by  a 
drawing  on  a  plane;  and  so  indeed  it  is:  but  a  very  fair  notion  of  the  nature  of 
the  surface  may  often  be  obtained  as  follows.  Let  the  equation  of  the  surface  be 

2  =  f{x,y)- 

If  we  give  z  a  particular  value  a,  we  have  an  equation  f(x,y )  =  a,  which 
we  may  regard  as  determining  a  plane  curve  on  the  paper.  We  trace  this  curve 
and  mark  it  (a).  Actually  the  curve  (a)  is  the  projection  on  the  plane  XOY 
of  the  section  of  the  surface  by  the  plane  z  =  a.  We  do  this  for  all  values  of  a 
(practically,  of  course,  for  a  selection  of  values  of  a).  We  obtain  some  such  figure 
as  is  shown  in  Fig.  17.  It  will  at  once  suggest  a  contoured  Ordnance  Survey 
map:  and  in  fact  this  is  the  principle  on  which  such  maps  are  constructed.  The 
contour  line  1000  is  the  projection,  on  the  plane  of  the  sea  level,  of  the  section 
of  the  surface  of  the  land  by  the  plane  parallel  to  the  plane  of  the  sea  level  and 
1000  ft.  above  it.* 


6.  Draw  a  series  of  contour  lines  to  illustrate  the  form  of  the  surface 
2  z  =  3  xy. 

7.  Right  circular  cones.  Take  the  origin  of  coordinates  at  the  vertex  of 


We  assume  that  the  effects  of  the  earth’s  curvature  may  be  neglected. 
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the  cone  and  the  axis  of  z  along  the  axis  of  the  cone;  and  let  a  be  the  semi¬ 
vertical  angle  of  the  cone.  The  equation  of  the  cone  (which  must  be  regarded  as 
extending  both  ways  from  its  vertex)  is  x2  +  y2  —  z2  tan2  a  =  0. 

8.  Surfaces  of  revolution  in  general.  The  cone  of  Ex.  7  cuts  ZOX 
in  two  lines  whose  equations  may  be  combined  in  the  equation  x 2  =  z 2  tan2  a. 
That  is  to  say,  the  equation  of  the  surface  generated  by  the  revolution  of  the 
curve  y  =  0,  x2  =  z2  tan2  a  round  the  axis  of  z  is  derived  from  the  second  of 
these  equations  by  changing  x2  into  x2  +  y2 .  Show  generally  that  the  equation 
of  the  surface  generated  by  the  revolution  of  the  curve  y  =  0,  x  =  f(z),  round 
the  axis  of  z,  is 

Vx2  +  y2  =  f(z). 

9.  Cones  in  general.  A  surface  formed  by  straight  lines  passing  through 
a  fixed  point  is  called  a  cone:  the  point  is  called  the  vertex.  A  particular  case 
is  given  by  the  right  circular  cone  of  Ex.  7.  Show  that  the  equation  of  a  cone 
whose  vertex  is  O  is  of  the  form  f(z/x,  z/y)  =  0,  and  that  any  equation  of  this 
form  represents  a  cone.  [If  ( x ,  y,  z)  lies  on  the  cone,  so  must  (Ax,  Ay,  Xz),  for  any 
value  of  A.] 

10.  Ruled  surfaces.  Cylinders  and  cones  are  special  cases  of  surfaces 
composed  of  straight  lines.  Such  surfaces  are  called  ruled  surfaces. 

The  two  equations 

x  =  az  +  b,  y  =  cz  +  d,  (1) 

represent  the  intersection  of  two  planes,  i.e.  a  straight  line.  Now  suppose  that 
a,  b,  c,  d  instead  of  being  fixed  are  functions  of  an  auxiliary  variable  t.  For  any 
particular  value  of  t  the  equations  (1)  give  a  line.  As  t  varies,  this  line  moves 
and  generates  a  surface,  whose  equation  may  be  found  by  eliminating  t  between 
the  two  equations  (1).  For  instance,  in  Ex.  7  the  equations  of  the  line  which 
generates  the  cone  are 


x  =  z  tan  a  cos  t,  y  =  z  tan  a  sin  t, 

where  t  is  the  angle  between  the  plane  XOZ  and  a  plane  through  the  line  and 
the  axis  of  z. 

Another  simple  example  of  a  ruled  surface  may  be  constructed  as  follows. 
Take  two  sections  of  a  right  circular  cylinder  perpendicular  to  the  axis  and  at 
a  distance  l  apart  (Fig.  18a).  We  can  imagine  the  surface  of  the  cylinder  to  be 


[II :  33] 


FUNCTIONS  OF  REAL  VARIABLES 


75 


made  up  of  a  number  of  thin  parallel  rigid  rods  of  length  l,  such  as  PQ,  the  ends 
of  the  rods  being  fastened  to  two  circular  rods  of  radius  a. 

Now  let  us  take  a  third  circular  rod  of  the  same  radius  and  place  it  round 
the  surface  of  the  cylinder  at  a  distance  h  from  one  of  the  first  two  rods  (see 
Fig.  18a,  where  Pq  =  h).  Unfasten  the  end  Q  of  the  rod  PQ  and  turn  PQ 
about  P  until  Q  can  be  fastened  to  the  third  circular  rod  in  the  position  Q' . 
The  angle  qOQ'  =  a  in  the  figure  is  evidently  given  by 

l2  —  h2  =  qQ'2  =  (2a  sin  ^ a )2  . 

Let  all  the  other  rods  of  which  the  cylinder  was  composed  be  treated  in  the  same 
way.  We  obtain  a  ruled  surface  whose  form  is  indicated  in  Fig.  18b.  It  is  entirely 
built  up  of  straight  lines;  but  the  surface  is  curved  everywhere,  and  is  in  general 
shape  not  unlike  certain  forms  of  table-napkin  rings  (Fig.  18c). 


MISCELLANEOUS  EXAMPLES  ON  CHAPTER  II. 

1.  Show  that  if  y  =  f(x)  =  (ax  +  b)/(cx  —  a)  then  x  =  f(y ). 

2.  If  f(x)  =  f(—x)  for  all  values  of  x,  f(x)  is  called  an  even  function. 
If  f(x)  =  —f(—x),  it  is  called  an  odd  function.  Show  that  any  function  of  x, 
defined  for  all  values  of  x,  is  the  sum  of  an  even  and  an  odd  function  of  x. 

[Use  the  identity  f(x)  =  \{f(x)  +  f(-x)}  +  \{f(x)  -  f(-x)}.} 
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3.  Draw  the  graphs  of  the  functions 


3  sin  x  +  4  cos  x ,  sin  —=  sin  x  . 

\y/2 


IT 


(Math.  Trip.  1896.) 


4.  Draw  the  graphs  of  the  functions 


sin  x(a  cos2  x  +  b  sin2  x),  - (a  cos2  x  +  b  sin2  x), 


x 


5.  Draw  the  graphs  of  the  functions  x[l/x],  [x]/x. 

6.  Draw  the  graphs  of  the  functions 

(i)  arccos(2x2  —  1)  —  2  arc  cos  x, 

....  a  +  x 

(u )  arc  tan - arc  tan  a  —  arc  tan  x, 

1  —  ax 

where  the  symbols  arc  cos  a,  arc  tan  a  denote,  for  any  value  of  a,  the  least  positive 
(or  zero)  angle,  whose  cosine  or  tangent  is  a. 

7.  Verify  the  following  method  of  constructing  the  graph  of  f{cj)(x)}  by 
means  of  the  line  y  =  x  and  the  graphs  of  /(x)  and  4>(x):  take  OA  =  x  along  OX , 
draw  AB  parallel  to  OY  to  meet  y  =  <j>(x)  in  B ,  BC  parallel  to  OX  to  meet 
y  =  x  in  C,  CD  parallel  to  OY  to  meet  y  =  f(x)  in  D,  and  DP  parallel  to  OX 
to  meet  AB  in  P;  then  P  is  a  point  on  the  graph  required. 

8.  Show  that  the  roots  of  x3  +px  +  q  =  0  are  the  abscissae  of  the  points  of 
intersection  (other  than  the  origin)  of  the  parabola  y  =  x2  and  the  circle 

x2  +y2  +  (p-  1  )y  +  qx  =  0. 


9.  The  roots  of  x4  +  nx3  +  px2  +  qx  +  r  =  0  are  the  abscissae  of  the  points 
of  intersection  of  the  parabola  x2  =  y  —  \nx  and  the  circle 


'  +  y2  +  (In2  -  \pn  +  \n  +  q)x  +  (p  -  1  -  \n2 


\  o  i  -L  r  —  D 


10.  Discuss  the  graphical  solution  of  the  equation 


xm  +  ax 2  +  hx  +  c  =  0 
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by  means  of  the  curves  y  =  xm,  y  =  — ax 2  —  bx  —  c.  Draw  up  a  table  of  the 
various  possible  numbers  of  roots. 

11.  Solve  the  equation  sec  6  +  cosec  6  =  2\/2;  and  show  that  the  equation 
sec  6  +  cosec  6  =  c  has  two  roots  between  0  and  27r  if  c2  <  8  and  four  if  c2  >8. 

12.  Show  that  the  equation 

2x  =  (2  n  +  l)7r(l  —  cosx), 


where  n  is  a  positive  integer,  has  2n  +  3  roots  and  no  more,  indicating  their 
localities  roughly.  (Math.  Trip.  1896.) 

13.  Show  that  the  equation  |x  sinx  =  1  has  four  roots  between  —  n  and  ir. 

14.  Discuss  the  number  and  values  of  the  roots  of  the  equations 

(1)  COt  X  +  X  —  |7T  =  0, 

(2)  x2  +  sin2  x  =  1, 

(3)  tanx  =  2x/(l  +  x2), 

(4)  sinx  —  x  +  |x3  =  0, 

(5)  (1  —  cosx)  tana  —  x  +  sinx  =  0. 

15.  The  polynomial  of  the  second  degree  which  assumes,  when  x  =  a,  b,  c 
the  values  a,  /3,  7  is 


a 


(x 

(a 


b)(x  —  c)  (x  —  c)(x  —  a)  (x  —  a)(x  —  b) 

b)(a  —  c)  (b  —  c)(b  —  a)  +  ^  (c  —  a)(c  —  b) 


Give  a  similar  formula  for  the  polynomial  of  the  (n—  l)th  degree  which  assumes, 
when  x  =  aq,  <22,  ...  an ,  the  values  ai,  02,  •  •  •  an. 

16.  Find  a  polynomial  in  x  of  the  second  degree  which  for  the  values  0,  1,  2 

of  x  takes  the  values  1/c,  l/(c  +  1),  l/(c  +  2);  and  show  that  when  x  =  c  +  2  its 
value  is  l/(c+  1).  (Math.  Trip.  1911.) 

17.  Show  that  if  x  is  a  rational  function  of  y,  and  y  is  a  rational  function 
of  x,  then  Axy  +  Bx  +  Cy  +  D  =  0. 

18.  If  y  is  an  algebraical  function  of  x,  then  x  is  an  algebraical  function  of  y. 

19.  Verify  that  the  equation 

1  1  * 

cos  jvrx  =1 - 

x  +  (x  —  1) 
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is  approximately  true  for  all  values  of  x  between  0  and  1.  [Take  x  =  0, 

|,  1,  and  use  tables.  For  which  of  these  values  is  the  formula  exact?] 

20.  What  is  the  form  of  the  graph  of  the  functions 

z=[x\  +  [y\,  z  =  x  +  y-[x\-[y]7 

21.  What  is  the  form  of  the  graph  of  the  functions  z  =  sinx  +  siny,  z  = 
sin x sin y,  z  =  sin xy,  z  =  sin(x2  +  y2)? 

22.  Geometrical  constructions  for  irrational  numbers.  In  Chapter  I 
we  indicated  one  or  two  simple  geometrical  constructions  for  a  length  equal 
to  \/2,  starting  from  a  given  unit  length.  We  also  showed  how  to  construct  the 
roots  of  any  quadratic  equation  ax2  +  2bx  +  c  =  0,  it  being  supposed  that  we  can 
construct  lines  whose  lengths  are  equal  to  any  of  the  ratios  of  the  coefficients 
a,  6,  c,  as  is  certainly  the  case  if  a,  b,  c  are  rational.  All  these  constructions 
were  what  may  be  called  Euclidean  constructions;  they  depended  on  the  ruler 
and  compasses  only. 

It  is  fairly  obvious  that  we  can  construct  by  these  methods  the  length  mea¬ 
sured  by  any  irrational  number  which  is  defined  by  any  combination  of  square 
roots,  however  complicated.  Thus 


N 


17  +  3VTT 
17-3\/TT 


17-  3VTl 
17  +  3v/TT 


is  a  case  in  point.  This  expression  contains  a  fourth  root,  but  this  is  of  course 
the  square  root  of  a  square  root.  We  should  begin  by  constructing  Vll,  e.g.  as 
the  mean  between  1  and  11:  then  17  +  3\/TT  and  17  —  3\/TT,  and  so  on.  Or  these 
two  mixed  surds  might  be  constructed  directly  as  the  roots  of  x2  —  34x  +  190  =  0. 

Conversely,  only  irrationals  of  this  kind  can  be  constructed  by  Euclidean 
methods.  Starting  from  a  unit  length  we  can  construct  any  rational  length. 
And  hence  we  can  construct  the  line  Ax  +  By  +  (7  =  0,  provided  that  the  ratios 
of  A,  B,  C  are  rational,  and  the  circle 


(x  —  a)2  +  (y  —  (3)2  =  p2 

(or  x2  +  y2  +  2 gx  +  2  fy  +  c  =  0),  provided  that  cc,  /3,  p  are  rational,  a  condition 
which  implies  that  y,  /,  c  are  rational. 
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Now  in  any  Euclidean  construction  each  new  point  introduced  into  the  figure 
is  determined  as  the  intersection  of  two  lines  or  circles,  or  a  line  and  a  circle. 
But  if  the  coefficients  are  rational,  such  a  pair  of  equations  as 

Ax  +  By  +  C  =  0,  x2  +  y2  +  2  gx  +  2  fy  +  c  =  0 

give,  on  solution,  values  of  x  and  y  of  the  form  m  +  riy/p,  where  m,  n,  p  are 
rational:  for  if  we  substitute  for  x  in  terms  of  y  in  the  second  equation  we  obtain 
a  quadratic  in  y  with  rational  coefficients.  Hence  the  coordinates  of  all  points 
obtained  by  means  of  lines  and  circles  with  rational  coefficients  are  expressible 
by  rational  numbers  and  quadratic  surds.  And  so  the  same  is  true  of  the  distance 
\J (x\  —  X2)2  +  (yi  —  yi)2  between  any  two  points  so  obtained. 

With  the  irrational  distances  thus  constructed  we  may  proceed  to  construct 
a  number  of  lines  and  circles  whose  coefficients  may  now  themselves  involve 
quadratic  surds.  It  is  evident,  however,  that  all  the  lengths  which  we  can  con¬ 
struct  by  the  use  of  such  lines  and  circles  are  still  expressible  by  square  roots 
only,  though  our  surd  expressions  may  now  be  of  a  more  complicated  form.  And 
this  remains  true  however  often  our  constructions  are  repeated.  Hence  Euclidean 
methods  will  construct  any  surd  expression  involving  square  roots  only,  and  no 
others. 

One  of  the  famous  problems  of  antiquity  was  that  of  the  duplication  of 
the  cube,  that  is  to  say  of  the  construction  by  Euclidean  methods  of  a  length 
measured  by  \J‘l.  It  can  be  shown  that  \/2  cannot  be  expressed  by  means  of  any 
finite  combination  of  rational  numbers  and  square  roots,  and  so  that  the  problem 
is  an  impossible  one.  See  Hobson,  Squaring  the  Circle ,  pp.  47  et  seq .;  the  first 
stage  of  the  proof,  viz.  the  proof  that  \f2  cannot  be  a  root  of  a  quadratic  equation 
ax2  +  2 bx  +  c  =  0  with  rational  coefficients,  was  given  in  Ch.  I  (Misc.  Exs.  24). 

23.  Approximate  quadrature  of  the  circle.  Let  O  be  the  centre  of  a 
circle  of  radius  R.  On  the  tangent  at  A  take  AP  =  Ar  anc[  j^q  =  -0/?.  in 
the  same  direction.  On  AO  take  AN  =  OP  and  draw  NM  parallel  to  OQ  and 
cutting  AP  in  M.  Show  that 


AM/R  =  §Vl46, 

and  that  to  take  AM  as  being  equal  to  the  circumference  of  the  circle  would 
lead  to  a  value  of  n  correct  to  five  places  of  decimals.  If  R  is  the  earth’s  radius, 
the  error  in  supposing  AM  to  be  its  circumference  is  less  than  11  yards. 
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24.  Show  that  the  only  lengths  which  can  be  constructed  with  the  ruler  only, 
starting  from  a  given  unit  length,  are  rational  lengths. 

25.  Constructions  for  \/ 2 .  O  is  the  vertex  and  S  the  focus  of  the  parabola 
y2  =  4x,  and  P  is  one  of  its  points  of  intersection  with  the  parabola  x2  =  2 y. 
Show  that  OP  meets  the  latus  rectum  of  the  first  parabola  in  a  point  Q  such 
that  SQ  =  \/2. 

26.  Take  a  circle  of  unit  diameter,  a  diameter  OA  and  the  tangent  at  A. 
Draw  a  chord  OBC  cutting  the  circle  at  B  and  the  tangent  at  C .  On  this  line 
take  OM  =  BC.  Taking  O  as  origin  and  OA  as  axis  of  x,  show  that  the  locus 
of  M  is  the  curve 

(x2  +  y2)x  -  y2  =  0 

(the  Cissoid  of  Diocles).  Sketch  the  curve.  Take  along  the  axis  of  y  a  length 
OD  =  2.  Let  AD  cut  the  curve  in  P  and  OP  cut  the  tangent  to  the  circle  at  A 
in  Q.  Show  that  AQ  =  \/2. 


CHAPTER  III 


COMPLEX  NUMBERS 

34.  Displacements  along  a  line  and  in  a  plane.  The  ‘real  num¬ 
ber’  x,  with  which  we  have  been  concerned  in  the  two  preceding  chapters, 
may  be  regarded  from  many  different  points  of  view.  It  may  be  regarded 
as  a  pure  number,  destitute  of  geometrical  significance,  or  a  geometrical 
significance  may  be  attached  to  it  in  at  least  three  different  ways.  It  may 
be  regarded  as  the  measure  of  a  length ,  viz.  the  length  A0P  along  the  line  A 
of  Chap.  I.  It  may  be  regarded  as  the  mark  of  a  point ,  viz.  the  point  P 
whose  distance  from  A0  is  x.  Or  it  may  be  regarded  as  the  measure  of  a 
displacement  or  change  of  position  on  the  line  A.  It  is  on  this  last  point  of 
view  that  we  shall  now  concentrate  our  attention. 

Imagine  a  small  particle  placed  at  P  on  the  line  A  and  then  displaced 
to  Q.  We  shall  call  the  displacement  or  change  of  position  which  is  needed 
to  transfer  the  particle  from  P  to  Q  the  displacement  PQ.  To  specify  a 
displacement  completely  three  things  are  needed,  its  magnitude ,  its  sense 
forwards  or  backwards  along  the  line,  and  what  may  be  called  its  point  of 
application ,  i.e.  the  original  position  P  of  the  particle.  But,  when  we  are 
thinking  merely  of  the  change  of  position  produced  by  the  displacement, 
it  is  natural  to  disregard  the  point  of  application  and  to  consider  all  dis¬ 
placements  as  equivalent  whose  lengths  and  senses  are  the  same.  Then  the 
displacement  is  completely  specified  by  the  length  PQ  =  x,  the  sense  of 
the  displacement  being  fixed  by  the  sign  of  x.  We  may  therefore,  without 
ambiguity,  speak  of  the  displacement  [x],*  and  we  may  write  PQ  =  [x]. 

We  use  the  square  bracket  to  distinguish  the  displacement  [x]  from  the 
length  or  number  x  J  If  the  coordinate  of  P  is  a,  that  of  Q  will  be  a  +  x; 


*It  is  hardly  necessary  to  caution  the  reader  against  confusing  this  use  of  the  sym¬ 
bol  [x]  and  that  of  Chap.  II  (Exs.  XVI.  and  Misc.  Exs.). 

t Strictly  speaking  we  ought,  by  some  similar  difference  of  notation,  to  distinguish 
the  actual  length  x  from  the  number  x  which  measures  it.  The  reader  will  perhaps  be 
inclined  to  consider  such  distinctions  futile  and  pedantic.  But  increasing  experience  of 
mathematics  will  reveal  to  him  the  great  importance  of  distinguishing  clearly  between 
things  which,  however  intimately  connected,  are  not  the  same.  If  cricket  were  a  math- 
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the  displacement  [x]  therefore  transfers  a  particle  from  the  point  a  to  the 
point  a  +  x. 

We  come  now  to  consider  displacements  in  a  plane.  We  may  define 
the  displacement  PQ  as  before.  But  now  more  data  are  required  in  order 
to  specify  it  completely.  We  require  to  know:  (i)  the  magnitude  of  the 
displacement,  i.e.  the  length  of  the  straight  line  PQ ;  (ii)  the  direction 
of  the  displacement,  which  is  determined  by  the  angle  which  PQ  makes 
with  some  fixed  line  in  the  plane;  (iii)  the  sense  of  the  displacement;  and 
(iv)  its  point  of  application.  Of  these  requirements  we  may  disregard  the 
fourth,  if  we  consider  two  displacements  as  equivalent  if  they  are  the  same 


Fig.  19. 

in  magnitude,  direction,  and  sense.  In  other  words,  if  PQ  and  RS  are 
equal  and  parallel,  and  the  sense  of  motion  from  P  to  Q  is  the  same  as 
that  of  motion  from  R  to  S,  we  regard  the  displacements  PQ  and  RS  as 
equivalent,  and  write 

PQ  =  RS. 

Now  let  us  take  any  pair  of  coordinate  axes  in  the  plane  (such  as 
OX,  OY  in  Fig.  19).  Draw  a  line  OA  equal  and  parallel  to  PQ,  the 
sense  of  motion  from  O  to  A  being  the  same  as  that  from  P  to  Q.  Then 
PQ  and  OA  are  equivalent  displacements.  Let  x  and  y  be  the  coordinates 


ematical  science,  it  would  be  very  important  to  distinguish  between  the  motion  of  the 
batsman  between  the  wickets,  the  run  which  he  scores,  and  the  mark  which  is  put  down 
in  the  score-book. 
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of  A.  Then  it  is  evident  that  OA  is  completely  specified  if  x  and  y  are 
given.  We  call  OA  the  displacement  [x,  y\  and  write 


OA  =  PQ  =  RS  =  [x,y]. 


35.  Equivalence  of  displacements.  Multiplication  of  displace¬ 
ments  by  numbers.  If  £  and  rj  are  the  coordinates  of  P ,  and  £'  and  rf 
those  of  Q,  it  is  evident  that 

X  =  £  -  £,  V  =  rf  -  T). 

The  displacement  from  (£,  ?/)  to  (£' ,r f)  is  therefore 


-v]- 


It  is  clear  that  two  displacements  [x,y],  [x',y'}  are  equivalent  if,  and 
only  if,  x  =  x',  y  =  y'.  Thus  [x,  y\  =  [x',  y']  if  and  only  if 

x  =  x',  y  =  y'.  (1) 

The  reverse  displacement  QP  would  be  [£  —  r /  —  rf],  and  it  is  natural 

to  agree  that 


[Z-g,v-  v'}  =  -[£'-  Z,  v'  -  y\ , 
QP  —  —PQ, 


these  equations  being  really  definitions  of  the  meaning  of  the  symbols 
—  [£'  —  £ ,rf  —  rf,  —PQ.  Having  thus  agreed  that 

~[x,y]  =  [~x,-y\, 

it  is  natural  to  agree  further  that 


a[x,  y\  =  [ax,  ay], 


(2) 
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where  a  is  any  real  number,  positive  or  negative.  Thus  (Fig.  19)  if 
OB  =  —  \OA  then 

OB  =  -\OA  =  -\[x,y]  =  [-\x,-\y\. 

The  equations  (1)  and  (2)  define  the  first  two  important  ideas  connected 
with  displacements,  viz.  equivalence  of  displacements,  and  multiplication 
of  displacements  by  numbers. 

36.  Addition  of  displacements.  We  have  not  yet  given  any  defi¬ 
nition  which  enables  us  to  attach  any  meaning  to  the  expressions 

PQ  +  P'Q\  [x,y]  +  [x',y']. 

Common  sense  at  once  suggests  that  we  should  define  the  sum  of  two 
displacements  as  the  displacement  which  is  the  result  of  the  successive 
application  of  the  two  given  displacements.  In  other  words,  it  suggests 
that  if  QQ\  be  drawn  equal  and  parallel  to  P'Q',  so  that  the  result  of 
successive  displacements  PQ,  P'Q'  on  a  particle  at  P  is  to  transfer  it  first 
to  Q  and  then  to  Q i  then  we  should  define  the  sum  of  PQ  and  P'Q'  as 
being  PQ\.  If  then  we  draw  OA  equal  and  parallel  to  PQ ,  and  OB  equal 
and  parallel  to  P'Q' ,  and  complete  the  parallelogram  OACB ,  we  have 

PQ  +  =  ~PQ[  =  OA  +  ~OB  =  OC. 

Let  us  consider  the  consequences  of  adopting  this  definition.  If  the 
coordinates  of  B  are  x',  y\  then  those  of  the  middle  point  of  AB  are 
\{x  +  x'),  +  y'),  and  those  of  C  are  x  +  x',  y  +  y'.  Hence 

[x,  y]  +  [x',  y']  =  [x  +  x',y  +  y'] ,  (3) 

which  may  be  regarded  as  the  symbolic  definition  of  addition  of  displace¬ 
ments.  We  observe  that 

[x',  y'}  +  [x,  y]  =  [x'  +  x,  y  +  y\ 

=  [x  +  xfy  +  y'}  =  [x,  y\  +  [x',  y'} 
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Fig.  20. 


In  other  words,  addition  of  displacements  obeys  the  commutative  law  ex¬ 
pressed  in  ordinary  algebra  by  the  equation  a+b  =  b+a.  This  law  expresses 
the  obvious  geometrical  fact  that  if  we  move  from  P  first  through  a  dis¬ 
tance  PQ2  equal  and  parallel  to  P'Q1 and  then  through  a  distance  equal 
and  parallel  to  PQ,  we  shall  arrive  at  the  same  point  Q 1  as  before. 

In  particular 

[x,y]  =  [x,0]  +  [0,y].  (4) 

Here  [x,  0]  denotes  a  displacement  through  a  distance  x  in  a  direction  par¬ 
allel  to  OX.  It  is  in  fact  what  we  previously  denoted  by  [x] ,  when  we  were 
considering  only  displacements  along  a  line.  We  call  [x,  0]  and  [0,  y\  the 
components  of  [x,y\,  and  [x,y]  their  resultant. 

When  we  have  once  defined  addition  of  two  displacements,  there  is  no 
further  difficulty  in  the  way  of  defining  addition  of  any  number.  Thus,  by 
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definition, 


[x,y\  +  [x',y']  +  [x",y"}  =  ([x,y]  +  [x\y'])  +  [x",y"] 

r  i  /  -  n  -  r  //  //i  r  i  /  i  tt  i  /  -  //i 

=  [x  +  a;,7/  +  7/J  +  [a;  ,t/  J  =  [x  +  i  +  x  ,  y  +  ?/  +  y  J . 

We  define  subtraction  of  displacements  by  the  equation 

[x,y\  ~  [x',y']  =  [x,y]  +  (-[.r'.i/]),  (5) 

which  is  the  same  thing  as  [x,y]  +  [— x',  —  y']  or  as  [x  —  x',y  —  y'].  In 
particular 

[x,y\  ~  [x,y]  =  [0,0], 

The  displacement  [0,  0]  leaves  the  particle  where  it  was;  it  is  the  zero 
displacement ,  and  we  agree  to  write  [0,  0]  =  0. 


Examples  XX.  1.  Prove  that 

(i)  a[/3x,/3y\  =  j3[ax,ay\  =  [af3x,a/3y], 

(ii)  ([x,  y]  +  [*',  y'})  +  [x'\  y"}  =  [x,  y]  +  ([x',  y’}  +  [*",  y"}), 

(hi)  [x,  y]  +  [x\  y'}  =  [x\  y']  +  [x,  y\, 

(iv)  (a  +  P)[x,y]=a[x,y]+P[x,y], 

(v)  a{[x,y\  +  [x',y']}  =  a[x,  y]  +  a[x',  y']. 

[We  have  already  proved  (iii).  The  remaining  equations  follow  with  equal 
ease  from  the  definitions.  The  reader  should  in  each  case  consider  the  geometrical 
significance  of  the  equation,  as  we  did  above  in  the  case  of  (iii).] 

2.  If  M  is  the  middle  point  of  PQ,  then  OM  =  \{OP  +  OQ).  More 
generally,  if  M  divides  PQ  in  the  ratio  y  :  A,  then 


OM 


A 

A  +  y 


OP  + 


y 

A  +  y 


OQ. 


3.  If  G  is  the  centre  of  mass  of  equal  particles  at  Pi,  P2,  ■  ■  ■ ,  Pn,  then 


OG  =  ( OP 1  +  OP2  +  •  •  •  +  OPn)/n. 
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4.  If  P.  Q ,  R  are  collinear  points  in  the  plane,  then  it  is  possible  to  find 
real  numbers  a,  P,  7,  not  all  zero,  and  such  that 

a  -  OP  +  P  -OQ  +  -r  -  OR  =  0- 

and  conversely.  [This  is  really  only  another  way  of  stating  Ex.  2.] 

5.  If  AB  and  AC  are  two  displacements  not  in  the  same  straight  line,  and 

a-AB  +  /?-IC  =  7-  AB  +  <5-AC, 

then  a  =  7  and  (3  =  5. 

[Take  AB±  =  a  ■  AB ,  AC\  =  ft  ■  AC.  Complete  the  parallelogram  AB\PiC\. 
Then  AP\  =  a  ■  AB  +  /3  •  AC.  It  is  evident  that  ^4Pi  can  only  be  expressed  in 
this  form  in  one  way,  whence  the  theorem  follows.] 

6.  ABCD  is  a  parallelogram.  Through  Q,  a  point  inside  the  parallelogram, 
RQS  and  TQU  are  drawn  parallel  to  the  sides.  Show  that  RU,  TS  intersect 
on  AC . 


DU  C 


[Let  the  ratios  ^4T  :  AB ,  AR  :  AD  be  denoted  by  a,  (3.  Then 


^4T  =  a  ■  AB,  AR 
AU  =  a-AB  +  AD,  AS 


p-AD , 

AB  +  P  ■  AD. 


Let  RU  meet  AC  in  P.  Then,  since  R,  U,  P  are  collinear, 


A 

A  +  /U 


AR  + 


I1 

A  +  fj, 


AU, 


AP  = 


[Ill :  37] 


COMPLEX  NUMBERS 


where  fi/X  is  the  ratio  in  which  P  divides  RU.  That  is  to  say 


AP=^-AB+?±±»AD. 


X  +  y 


A  +  /i 


But  since  P  lies  on  AC,  AP  is  a  numerical  multiple  of  AC;  say 
AP  =  k-AC  =  k-AB  +  k-  AD. 


Hence  (Ex.  5)  ay  =  /3X  +  y  =  (X  +  y)k,  from  which  we  deduce 

k  -  aP 

a  T  f3  —  1 

The  symmetry  of  this  result  shows  that  a  similar  argument  would  also  give 


AP’  = 


af3 

a  T  f3  —  1 


AC, 


if  P'  is  the  point  where  TS  meets  AC.  Hence  P  and  P'  are  the  same  point.] 

7.  ABCD  is  a  parallelogram,  and  M  the  middle  point  of  AB.  Show  that 
DM  trisects  and  is  trisected  by  AC* 


37.  Multiplication  of  displacements.  So  far  we  have  made  no 
attempt  to  attach  any  meaning  whatever  to  the  notion  of  the  product  of  two 
displacements.  The  only  kind  of  multiplication  which  we  have  considered 
is  that  in  which  a  displacement  is  multiplied  by  a  number.  The  expression 

[x,  y]  x  [x\  y'} 

so  far  means  nothing,  and  we  are  at  liberty  to  define  it  to  mean  anything 
we  like.  It  is,  however,  fairly  clear  that  if  any  definition  of  such  a  product 
is  to  be  of  any  use,  the  product  of  two  displacements  must  itself  be  a 
displacement. 

We  might,  for  example,  define  it  as  being  equal  to 

[x  +  x'  ,y  +  y']; 

*The  two  preceding  examples  are  taken  from  Willard  Gibbs’  Vector  Analysis. 
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in  other  words,  we  might  agree  that  the  product  of  two  displacements  was 
to  be  always  equal  to  their  sum.  But  there  would  be  two  serious  objections 
to  such  a  definition.  In  the  first  place  our  definition  would  be  futile.  We 
should  only  be  introducing  a  new  method  of  expressing  something  which 
we  can  perfectly  well  express  without  it.  In  the  second  place  our  definition 
would  be  inconvenient  and  misleading  for  the  following  reasons.  If  a  is  a 
real  number,  we  have  already  defined  a[x,y\  as  [ax,  ay].  Now,  as  we  saw 
in  §  34,  the  real  number  a  may  itself  from  one  point  of  view  be  regarded 
as  a  displacement,  viz.  the  displacement  [a]  along  the  axis  OX,  or,  in  our 
later  notation,  the  displacement  [a,0].  It  is  therefore,  if  not  absolutely 
necessary,  at  any  rate  most  desirable,  that  our  definition  should  be  such 
that 

[a,0][x,y]  =  [ax,  ay], 

and  the  suggested  definition  does  not  give  this  result. 

A  more  reasonable  definition  might  appear  to  be 

[x,y][x',y']  =  [xx',yy']. 

But  this  would  give 

[a,0][x,y]  =  [ax,  0]; 

and  so  this  definition  also  would  be  open  to  the  second  objection. 

In  fact,  it  is  by  no  means  obvious  what  is  the  best  meaning  to  attach  to 
the  product  [x,  y]  [x' ,  y'].  All  that  is  clear  is  (1)  that,  if  our  definition  is  to 
be  of  any  use,  this  product  must  itself  be  a  displacement  whose  coordinates 
depend  on  x  and  y,  or  in  other  words  that  we  must  have 

[x,y][x',y']  =  [X,Y], 

where  X  and  Y  are  functions  of  x,  y,  x' ,  and  y'\  (2)  that  the  definition 
must  be  such  as  to  agree  with  the  equation 


[^,  0]  [a;',  2/']  =  [xx',xy'\, 
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and  (3)  that  the  definition  must  obey  the  ordinary  commutative,  distribu¬ 
tive,  and  associative  laws  of  multiplication,  so  that 


[x,y][x',y']  =  [x',y'][x,y\, 

([x,y]  +  [x\y'])[x\y"}  =  [x,y}[x",y"}  +  [x',  y'][x\  y"], 
[x,y]{[x' ,y'}  +  [x”,y"})  =  [x,y][x',y'\  +  [x,  y]{x",  ?/"], 


and 

[x,  y]  ( [x\  y')  [x",  y"\ )  =  ( [x,  y]  [x\  y'] )  [x",  y") . 

38.  The  right  definition  to  take  is  suggested  as  follows.  We  know  that, 
if  OAB ,  OCD  are  two  similar  triangles,  the  angles  corresponding  in  the 
order  in  which  they  are  written,  then 

OB/OA  =  OD/OC, 

or  OB  ■  OC  =  OA  ■  OD.  This  suggests  that  we  should  try  to  define 
multiplication  and  division  of  displacements  in  such  a  way  that 

OB/OA  =  OD/OC,  OB  OC  =  OA-OD. 


Now  let 


OB=[x,y],  OC  =  [x',y'},  OD  =  [X,Y ], 
and  suppose  that  A  is  the  point  (1,0),  so  that  OA  =  [1,0].  Then 
OA-OB  =  [1,0][X,Y]  =  [X,Y], 

and  so 

lx,y}[x',y'}  =  [X,Y]. 

The  product  OB  ■  OC  is  therefore  to  be  defined  as  OD ,  D  being  obtained 
by  constructing  on  OC  a  triangle  similar  to  OAB.  In  order  to  free  this 
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D 


definition  from  ambiguity,  it  should  be  observed  that  on  OC  we  can  de¬ 
scribe  two  such  triangles,  OCD  and  OCD' .  We  choose  that  for  which  the 
angle  COD  is  equal  to  AOB  in  sign  as  well  as  in  magnitude.  We  say  that 
the  two  triangles  are  then  similar  in  the  same  sense. 

If  the  polar  coordinates  of  B  and  C  are  (p,  9)  and  (a,  0),  so  that 

x  =  pcos9 ,  y  —  p  sin  9\  x'  —  acoscj),  y'  —  a  sin  0, 

then  the  polar  coordinates  of  D  are  evidently  pa  and  9  +  0.  Hence 

X  =  pa  cos (9  +  (ft)  =  xx1  —  yy1, 

Y  =  pa  sin(#  +  0)  =  xy'  +  yx' . 

The  required  definition  is  therefore 

[x,  y]  W,  y)  =  [xx'  -  yy',  xy'  +  yx'}.  (6) 

We  observe  (1)  that  if  y  —  0,  then  X  =  xx' ,  Y  =  xy' ,  as  we  desired; 
(2)  that  the  right-hand  side  is  not  altered  if  we  interchange  x  and  x' ,  and 
y  and  y' ,  so  that 

[x,y][x',y']  =  [x' ,y'][x,y\, 
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and  (3)  that 

{[x,  y\  +  [x',  y']}[x",  y”)  =  [x  +  x',  y  +  y'][x” ,  y") 

=  [(x  +  x')x"  -  (y  +  y’)y",  (x  +  x’)y"  +  (y  +  y')x"} 

=  [xx"  —  yy" ,  xy"  +  yx"]  +  [x'x"  —  y'y",  x'y"  +  y’x") 

=  [x,y][x",y"}  +  [x',y'}{x",y"}. 

Similarly  we  can  verify  that  all  the  equations  at  the  end  of  §  37  are 
satisfied.  Thus  the  definition  (6)  fulfils  all  the  requirements  which  we  made 
of  it  in  5  37. 


Example.  Show  directly  from  the  geometrical  definition  given  above  that 
multiplication  of  displacements  obeys  the  commutative  and  distributive  laws. 
[Take  the  commutative  law  for  example.  The  product  OB  ■  OC  is  OD  (Fig.  22), 
COD  being  similar  to  AOB.  To  construct  the  product  OC  ■  OB  we  should  have 
to  construct  on  OB  a  triangle  BOD\  similar  to  AOC\  and  so  what  we  want  to 
prove  is  that  D  and  D\  coincide,  or  that  BOD  is  similar  to  AOC.  This  is  an 
easy  piece  of  elementary  geometry.] 

39.  Complex  numbers.  Just  as  to  a  displacement  [x]  along  OX 
correspond  a  point  (x)  and  a  real  number  x,  so  to  a  displacement  [x,  y\  in 
the  plane  correspond  a  point  (x,  y)  and  a  pair  of  real  numbers  x,  y. 

We  shall  find  it  convenient  to  denote  this  pair  of  real  numbers  x,  y  by 
the  symbol 

x  +  yi. 

The  reason  for  the  choice  of  this  notation  will  appear  later.  For  the  present 
the  reader  must  regard  x  +  yi  as  simply  another  way  of  writing  [x,  y\.  The 
expression  x  +  yi  is  called  a  complex  number. 

We  proceed  next  to  define  equivalence,  addition,  and  multiplication  of 
complex  numbers.  To  every  complex  number  corresponds  a  displacement. 
Two  complex  numbers  are  equivalent  if  the  corresponding  displacements 
are  equivalent.  The  sum  or  product  of  two  complex  numbers  is  the  complex 
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number  which  corresponds  to  the  sum  or  product  of  the  two  corresponding 
displacements.  Thus 

x  +  yi  =  x!  +  y'i,  (1) 

if  and  only  if  x  —  x',  y  —  y'\ 

(x  +  yi)  +  (x'  +  y'i)  —  (x  +  x')  +  (y  +  y')i;  (2) 

(x  +  yi)(x'  +  y'i)  =  xx'  —  yy'  +  [xy  +  yx')i.  (3) 

In  particular  we  have,  as  special  cases  of  (2)  and  (3), 

x  +  yi  —  (x  +  0  i)  +  (0  +  yi), 

(x  +  0  i)(x'  +  y'i )  =  xx'  +  xy'i; 

and  these  equations  suggest  that  there  will  be  no  danger  of  confusion  if, 
when  dealing  with  complex  numbers,  we  write  x  for  x  +  ()?'  and  yi  for  0  +  yi, 
as  we  shall  henceforth. 

Positive  integral  powers  and  polynomials  of  complex  numbers  are  then 
defined  as  in  ordinary  algebra.  Thus,  by  putting  x  —  x',  y  —  y'  in  (3),  we 
obtain 

(x  +  yi)2  —  (x  +  yi)(x  +  yi)  =  x2  —  y2  +  2  xyi. 

The  reader  will  easily  verify  for  himself  that  addition  and  multiplication 
of  complex  numbers  obey  the  laws  of  algebra  expressed  by  the  equations 

(x  +  yi)  +  (V  +  y'i)  =  ( x  +  y'i)  +  {x  +  yi), 

{(a:  +  yi)  +  (x1  +  y'i)}  +  (x"  +  y"i)  =  (x  +  yi)  +  {(a;7  +  y'i)  +  (x"  +  y"i)}, 
(x  +  yi)(x'  +  y'i)  =  (V  +  y'i)(x  +  yi), 

(x  +  yi){{x'  +  y'i)  +  (x"  +  y"i)}  —  (x  +  yi)(x'  +  y'i)  +  (x  +  yi)(x"  +  y"i), 
{(a;  +  yi)  +  (x'  +  y'i)}(x"  +  y"i)  —  (x  +  yi)(x"  +  y"i)  +  (x'  +  y'i)(x"  +  y"i), 
(x  +  yi){(x'  +  y'i)(x"  +  y"i)}  =  {(x  +  yi)(x'  +  y'i)}(x"  +  y"i), 


the  proofs  of  these  equations  being  practically  the  same  as  those  of  the 
corresponding  equations  for  the  corresponding  displacements. 
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Subtraction  and  division  of  complex  numbers  are  defined  as  in  ordinary 
algebra.  Thus  we  may  define  (x  +  yi)  —  (V  +  y'i )  as 

(x  +  yi)  +  {  —  (V  +  y'i)}  =  x  +  yi  +  (— x'  —  y'i)  —  (x  —  x')  +  (y  —  y7)i; 

or  again,  as  the  number  £  +  yi  such  that 

(a/  +  y'i)  +  (£  +  yi)  =  x  +  yi, 

which  leads  to  the  same  result.  And  (x  +  yi)/{x '  +  y'i)  is  defined  as  being 
the  complex  number  £  +  yi  such  that 

(x'  +  y'i)  (£  +  yi)  =  x  +  yi, 

or 

x‘ 7£  —  y'y  +  (x'y  +  y7£)i  =  a:  +  yi, 
or 

o'f  -  y'v  =  x,  x'y  +  y'£  =  y.  (4) 

Solving  these  equations  for  £  and  y,  we  obtain 

xx'  +  yy'  yx'  —  xy' 

^  /9  i  /9  5  /9  i  /9  * 

x  +  a:  +  y 

This  solution  fails  if  x'  and  y'  are  both  zero,  i.e.  if  x'  +  y'i  =  0.  Thus 
subtraction  is  always  possible;  division  is  always  possible  unless  the  divisor 
is  zero. 

Examples.  (1)  From  a  geometrical  point  of  view,  the  problem  of  the  divi¬ 
sion  of  the  displacement  OB  by  OC  is  that  of  finding  D  so  that  the  triangles 
COB ,  AOD  are  similar,  and  this  is  evidently  possible  (and  the  solution  unique) 
unless  C  coincides  with  0,  or  OC  =  0. 

(2)  The  numbers  x  +  yi,  x  —  yi  are  said  to  be  conjugate.  Verify  that 

(x  +  yi)(x-yi)  =  x2  +  y2 , 

so  that  the  product  of  two  conjugate  numbers  is  real,  and  that 

x  +  yi  (x  +  yi)(x'  —  y'i)  xx'  +  yy'  +  (x'y  —  xy')i 
x'  +  y'i  (x'  +  y'i){x’  —  y'i)  x'2  +  y'2 


[Ill :  41] 


COMPLEX  NUMBERS 


95 


Fig.  23. 


40.  One  most  important  property  of  real  numbers  is  that  known  as 
the  factor  theorem ,  which  asserts  that  the  product  of  two  numbers  cannot 
be  zero  unless  one  of  the  two  is  itself  zero.  To  prove  that  this  is  also  true  of 
complex  numbers  we  put  x  —  0,  y  —  0  in  the  equations  (4)  of  the  preceding 
section.  Then 

x'f  —  y'rj  =  0,  xp  +  y'f  =  0. 

These  equations  give  £  =  0,  rj  =  0,  i.e. 

f  +  yi  =  0, 

unless  x'  —  0  and  y'  =  0,  or  x'  +  y'i  =  0.  Thus  x  +  yi  cannot  vanish  unless 
either  x'  +  y'i  or  £  +  rji  vanishes. 

41.  The  equation  i2  =  —1.  We  agreed  to  simplify  our  notation  by 
writing  x  instead  of  rc  +  0t  and  yi  instead  of  0  +  yi.  The  particular  complex 
number  Vi  we  shall  denote  simply  by  i.  It  is  the  number  which  corresponds 
to  a  unit  displacement  along  OY.  Also 

i2  =  U  =  (0  +  li)(0  +  li)  =  (0  •  0  -  1  •  1)  +  (0  •  1  +  1  •  0)i  =  -1. 

Similarly  (— i)2  =  —1.  Thus  the  complex  numbers  i  and  —i  satisfy  the 
equation  x2  =  —1. 
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The  reader  will  now  easily  satisfy  himself  that  the  upshot  of  the  rules 
for  addition  and  multiplication  of  complex  numbers  is  this,  that  we  oper¬ 
ate  with  complex  numbers  in  exactly  the  same  way  as  with  real  numbers, 
treating  the  symbol  i  as  itself  a  number,  but  replacing  the  product  ii  =  i2 
by  —1  whenever  it  occurs.  Thus,  for  example, 

(x  +  yi)(x'  +  y'i)  =  xx  +  xy'i  +  yx'i  +  yy'i 2 
=  ( xx '  —  yy')  +  (xy'  +  yx')i. 


42. 

Since 


The  geometrical  interpretation  of  multiplication  by  i. 

(x  +  yi)i  =  —  y  +  xi, 


it  follows  that  if  x  +  yi  corresponds  to  OP,  and  OQ  is  drawn  equal  to  OP 
and  so  that  POQ  is  a  positive  right  angle,  then  ( x  +  yi)i  corresponds 
to  OQ.  In  other  words,  multiplication  of  a  complex  number  by  i  turns  the 
corresponding  displacement  through  a  right  angle. 

We  might  have  developed  the  whole  theory  of  complex  numbers  from 
this  point  of  view.  Starting  with  the  ideas  of  x  as  representing  a  displace¬ 
ment  along  OX,  and  of  i  as  a  symbol  of  operation  equivalent  to  turning  x 
through  a  right  angle,  we  should  have  been  led  to  regard  yi  as  a  displace¬ 
ment  of  magnitude  y  along  OY .  It  would  then  have  been  natural  to  define 
x  +  yi  as  in  §§  37  and  40,  and  (x  +  yi)i  would  have  represented  the  dis¬ 
placement  obtained  by  turning  x  +  yi  through  a  right  angle,  i.e.  —y  +  xi. 
Finally,  we  should  naturally  have  defined  (x  +  yi)x'  as  xx'  +  yx'i,  (x  +  yi)y'i 
as  —yy'  +  xy'i ,  and  (x  +  yi)(x'  +  y'i)  as  the  sum  of  these  displacements, 
i.e.  as 

xx'  —  yy'  +  {xy  +  yx')i. 


43.  The  equations  z2  +  1  =  0,  az 2  +  2 bz  +  c  =  0.  There  is  no 
real  number  z  such  that  z2  +  1  =  0;  this  is  expressed  by  saying  that  the 
equation  has  no  real  roots.  But,  as  we  have  just  seen,  the  two  complex 
numbers  i  and  —i  satisfy  this  equation.  We  express  this  by  saying  that  the 
equation  has  the  two  complex  roots  i  and  —i.  Since  i  satisfies  z2  =  —1,  it 
is  sometimes  written  in  the  form  y— 1. 
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Complex  numbers  are  sometimes  called  imaginary.*  The  expression  is 
by  no  means  a  happily  chosen  one,  but  it  is  firmly  established  and  has 
to  be  accepted.  It  cannot,  however,  be  too  strongly  impressed  upon  the 
reader  that  an  ‘imaginary  number’  is  no  more  ‘imaginary’,  in  any  ordinary 
sense  of  the  word,  than  a  ‘real’  number;  and  that  it  is  not  a  number  at 
all,  in  the  sense  in  which  the  ‘real’  numbers  are  numbers,  but,  as  should 
be  clear  from  the  preceding  discussion,  a  pair  of  numbers  ( x,y ),  united 
symbolically,  for  purposes  of  technical  convenience,  in  the  form  x  +  yi. 
Such  a  pair  of  numbers  is  no  less  ‘real’  than  any  ordinary  number  such 
as  or  than  the  paper  on  which  this  is  printed,  or  than  the  Solar  System. 
Thus 

i  —  0  +  li 

stands  for  the  pair  of  numbers  (0, 1),  and  may  be  represented  geometrically 
by  a  point  or  by  the  displacement  [0, 1].  And  when  we  say  that  i  is  a  root 
of  the  equation  z2  +  1  =  0,  what  we  mean  is  simply  that  we  have  defined 
a  method  of  combining  such  pairs  of  numbers  (or  displacements)  which 
we  call  ‘multiplication’,  and  which,  when  we  so  combine  (0, 1)  with  itself, 
gives  the  result  (—1,  0). 

Now  let  us  consider  the  more  general  equation 

az2  +  2  bz  +  c  =  0, 

where  a,  b,  c  are  real  numbers.  If  b 2  >  ac,  the  ordinary  method  of  solution 
gives  two  real  roots 

{—b  ±  Vb2  —  ac}/a. 

If  b 2  <  ac,  the  equation  has  no  real  roots.  It  may  be  written  in  the  form 

{z  +  ( b/a )}2  =  — (ac  —  b2)/a 2 , 

an  equation  which  is  evidently  satisfied  if  we  substitute  for  z  +  {b/a)  either 
of  the  complex  numbers  ±i\/ac  —  b2 /a)  We  express  this  by  saying  that 
the  equation  has  the  two  complex  roots 

{—b  ±  iVac  —  b2}/a. 


*The  phrase  ‘real  number’  was  introduced  as  an  antithesis  to  ‘imaginary  number’. 
^We  shall  sometimes  write  x  +  iy  instead  of  x  +  yi  for  convenience  in  printing. 
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If  we  agree  as  a  matter  of  convention  to  say  that  when  b2  =  ac  (in 
which  case  the  equation  is  satisfied  by  one  value  of  x  only,  viz.  — b/a ),  the 
equation  has  two  equal  roots ,  we  can  say  that  a  quadratic  equation  with  real 
coefficients  has  two  roots  in  all  cases,  either  two  distinct  real  roots,  or  two 
equal  real  roots,  or  two  distinct  complex  roots. 

The  question  is  naturally  suggested  whether  a  quadratic  equation  may 
not,  when  complex  roots  are  once  admitted,  have  more  than  two  roots.  It  is 
easy  to  see  that  this  is  not  possible.  Its  impossibility  may  in  fact  be  proved 
by  precisely  the  same  chain  of  reasoning  as  is  used  in  elementary  algebra 
to  prove  that  an  equation  of  the  nth  degree  cannot  have  more  than  n  real 
roots.  Let  us  denote  the  complex  number  x  +  yi  by  the  single  letter  z,  a 
convention  which  we  may  express  by  writing  z  =  x  +  yi.  Let  f(z)  denote 
any  polynomial  in  z,  with  real  or  complex  coefficients.  Then  we  prove  in 
succession: 

(1)  that  the  remainder,  when  f(z)  is  divided  by  z  —  a,  a  being  any  real 
or  complex  number,  is  /(a); 

(2)  that  if  a  is  a  root  of  the  equation  f(z)  =  0,  then  f(z)  is  divisible 
by  z  —  a; 

(3)  that  if  f(z)  is  of  the  nth  degree,  and  f(z)  =  0  has  the  n  roots  cq, 
$2,  •  •  • ,  an,  then 


f(z)  =  A(z  -  af(z  -a2)...(z-  an), 

where  A  is  a  constant,  real  or  complex,  in  fact  the  coefficient  of  zn  in  f(z). 
From  the  last  result,  and  the  theorem  of  §  40,  it  follows  that  f(z)  cannot 
have  more  than  n  roots. 

We  conclude  that  a  quadratic  equation  with  real  coefficients  has  exactly 
two  roots.  We  shall  see  later  on  that  a  similar  theorem  is  true  for  an 
equation  of  any  degree  and  with  either  real  or  complex  coefficients:  an 
equation  of  the  nth  degree  has  exactly  n  roots.  The  only  point  in  the  proof 
which  presents  any  difficulty  is  the  first,  viz.  the  proof  that  any  equation 
must  have  at  least  one  root.  This  we  must  postpone  for  the  present.*  We 
may,  however,  at  once  call  attention  to  one  very  interesting  result  of  this 


See  Appendix  I. 
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theorem.  In  the  theory  of  number  we  start  from  the  positive  integers  and 
from  the  ideas  of  addition  and  multiplication  and  the  converse  operations 
of  subtraction  and  division.  We  find  that  these  operations  are  not  always 
possible  unless  we  admit  new  kinds  of  numbers.  We  can  only  attach  a 
meaning  to  3  —  7  if  we  admit  negative  numbers,  or  to  |  if  we  admit  rational 
fractions.  When  we  extend  our  list  of  arithmetical  operations  so  as  to 
include  root  extraction  and  the  solution  of  equations,  we  find  that  some 
of  them,  such  as  that  of  the  extraction  of  the  square  root  of  a  number 
which  (like  2)  is  not  a  perfect  square,  are  not  possible  unless  we  widen  our 
conception  of  a  number,  and  admit  the  irrational  numbers  of  Chap.  I. 

Others,  such  as  the  extraction  of  the  square  root  of  —1,  are  not  possible 
unless  we  go  still  further,  and  admit  the  complex  numbers  of  this  chapter. 
And  it  would  not  be  unnatural  to  suppose  that,  when  we  come  to  consider 
equations  of  higher  degree,  some  might  prove  to  be  insoluble  even  by  the  aid 
of  complex  numbers,  and  that  thus  we  might  be  led  to  the  considerations 
of  higher  and  higher  types  of,  so  to  say,  hyper-complex  numbers.  The  fact 
that  the  roots  of  any  algebraical  equation  whatever  are  ordinary  complex 
numbers  shows  that  this  is  not  the  case.  The  application  of  any  of  the 
ordinary  algebraical  operations  to  complex  numbers  will  yield  only  complex 
numbers.  In  technical  language  ‘the  held  of  the  complex  numbers  is  closed 
for  algebraical  operations’. 

Before  we  pass  on  to  other  matters,  let  us  add  that  all  theorems  of 
elementary  algebra  which  are  proved  merely  by  the  application  of  the  rules 
of  addition  and  multiplication  are  true  whether  the  numbers  which  occur 
in  them  are  real  or  complex,  since  the  rules  referred  to  apply  to  complex  as 
well  as  real  numbers.  For  example,  we  know  that,  if  a  and  [3  are  the  roots 
of 

az 2  +  2  bz  +  c  =  0, 

then 

a  +  (3  =  —(26/a),  a/3  =  (c/a). 

Similarly,  if  a,  (3,  7  are  the  roots  of 


az3  +  3  bz2  +  3cz  +  d  =  0, 
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then 

a  +  f3  +  7  =  —  (36/a),  /Ly  +  ya  +  a/3  =  (3c/ a),  a/Ty  —  —  (d/a). 

All  such  theorems  as  these  are  true  whether  a,  b,  . . . ,  a,  (3,  . . .  are  real  or 
complex. 

44.  Argand’s  diagram.  Let  P  (Fig.  24)  be  the  point  (x,y),  r  the 
length  OP,  and  9  the  angle  XOP,  so  that 

x  =  r  cos  9,  y  =  rsin#,  r  =  \J x2  +  y 2 ,  cos  9  :  sin  9  :  1  ::  x  :  y  :  r. 

We  denote  the  complex  number  x  +  yi  by  z,  as  in  §  43,  and  we  call  z  the 
complex  variable.  We  call  P  the  point  z,  or  the  point  corresponding  to  z; 


z  the  argument  of  P,  x  the  real  part,  y  the  imaginary  part,  r  the  modulus, 
and  9  the  amplitude  of  z\  and  we  write 

x  =  R(z),  y  =  I  (z),  r  =  |z|,  9  =  am  z. 


When  y  —  0  we  say  that  z  is  real,  when  x  =  0  that  z  is  purely  imag¬ 
inary.  Two  numbers  x  +  yi,  x  —  yi  which  differ  only  in  the  signs  of  their 
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imaginary  parts,  we  call  conjugate.  It  will  be  observed  that  the  sum  2x 
of  two  conjugate  numbers  and  their  product  x2  +  y2  are  both  real,  that 
they  have  the  same  modulus  \J x1  +  y2  and  that  their  product  is  equal  to 
the  square  of  the  modulus  of  either.  The  roots  of  a  quadratic  with  real 
coefficients,  for  example,  are  conjugate,  when  not  real. 

It  must  be  observed  that  6  or  am  z  is  a  many- valued  function  of  x  and  y, 
having  an  infinity  of  values,  which  are  angles  differing  by  multiples  of  2n* 
A  line  originally  lying  along  OX  will,  if  turned  through  any  of  these  angles, 
come  to  lie  along  OP.  We  shall  describe  that  one  of  these  angles  which 
lies  between  —  n  and  7r  as  the  principal  value  of  the  amplitude  of  z.  This 
definition  is  unambiguous  except  when  one  of  the  values  is  7 r,  in  which  case 
—7 r  is  also  a  value.  In  this  case  we  must  make  some  special  provision  as  to 
which  value  is  to  be  regarded  as  the  principal  value.  In  general,  when  we 
speak  of  the  amplitude  of  z  we  shall,  unless  the  contrary  is  stated,  mean 
the  principal  value  of  the  amplitude. 

Fig.  24  is  usually  known  as  Argand’s  diagram. 

45.  De  Moivre’s  Theorem.  The  following  statements  follow  im¬ 
mediately  from  the  definitions  of  addition  and  multiplication. 

(1)  The  real  (or  imaginary)  part  of  the  sum  of  two  complex  numbers 
is  equal  to  the  sum  of  their  real  (or  imaginary)  parts. 

(2)  The  modulus  of  the  product  of  two  complex  numbers  is  equal  to 
the  product  of  their  moduli. 

(3)  The  amplitude  of  the  product  of  two  complex  numbers  is  either 
equal  to  the  sum  of  their  amplitudes,  or  differs  from  it  by  27T. 

It  should  be  observed  that  it  is  not  always  true  that  the  principal  value 
of  am {zz')  is  the  sum  of  the  principal  values  of  am  z  and  am  z' .  For  example, 
if  2  =  z'  =  —  1  +  *,  then  the  principal  values  of  the  amplitudes  of  z  and  z'  are 
each  1 7 r.  But  zz'  =  —2 i,  and  the  principal  value  of  am(zz')  is  —  ^7r  and  not  |7 r. 


*It  is  evident  that  \z\  is  identical  with  the  polar  coordinate  r  of  P ,  and  that  the 
other  polar  coordinate  6  is  one  value  of  am  z.  This  value  is  not  necessarily  the  principal 
value,  as  defined  below,  for  the  polar  coordinate  of  §  22  lies  between  0  and  27t,  and  the 
principal  value  between  —  7r  and  7r. 
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The  two  last  theorems  may  be  expressed  in  the  equation 

r(cosd  +  isin#)  x  p(cos0  +  ism  (ft)  =  rp{cos(9  +  (ft)  +  isin(#  +  0)}, 

which  may  be  proved  at  once  by  multiplying  out  and  using  the  ordinary 
trigonometrical  formulae  for  cos(d  +  (ft)  and  sin(d  +  (ft).  More  generally 

ri(cos#i  +  isin#i)  x  r2(cos#2  +  isin#2)  x  . . .  x  rn(cos 9n  +  isin#n) 

—  rpr2  . .  .r n{cos(0i  +  9-2  +  •  •  •  +  9n)  +  i  sin($!  +  02  +  •  •  •  +  0n)}. 


A  particularly  interesting  case  is  that  in  which 

n  =  r2  =  ■  ■  ■  =  rn  =  1,  9i  =  92  =  ■  ■  ■  =  9n  =  9. 


We  then  obtain  the  equation 

(cos  9  +  i  sin  9)n  =  cos  n9  +  i  sin  n9, 


where  n  is  any  positive  integer:  a  result  known  as  De  Moivre’s  Theorem,.* 
Again,  if 

z  =  r(cos  9  +  i  sin  9) 

then 

1/z  =  (cos  9  —  i  sin  9)  jr. 

Thus  the  modulus  of  the  reciprocal  of  z  is  the  reciprocal  of  the  modulus 
of  z,  and  the  amplitude  of  the  reciprocal  is  the  negative  of  the  amplitude 
of  z.  We  can  now  state  the  theorems  for  quotients  which  correspond  to 
(2)  and  (3). 

(4)  The  modulus  of  the  quotient  of  two  complex  numbers  is  equal  to 
the  quotient  of  their  moduli. 

(5)  The  amplitude  of  the  quotient  of  two  complex  numbers  either  is 
equal  to  the  difference  of  their  amplitudes,  or  differs  from  it  by  2tt. 

*It  will  sometimes  be  convenient,  for  the  sake  of  brevity,  to  denote  cos  9  +  i  sin# 
by  Cis  9:  in  this  notation,  suggested  by  Profs.  Harkness  and  Morley,  De  Moivre’s  theo¬ 
rem  is  expressed  by  the  equation  (Cis0)n  =  Cisnd. 
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Again 


(cos  9  +  i  sin  6)~n  =  (cos  9  —  i  sin  9)n 

=  {cos( — 0)  +isin(— 9)}n 
=  cos  (—n9)  +  isin(— n9). 

Hence  De  Moivre ’s  Theorem  holds  for  all  integral  values  of  n,  positive  or 
negative. 

To  the  theorems  (l)-(5)  we  may  add  the  following  theorem,  which  is 
also  of  very  great  importance. 

(6)  The  modulus  of  the  sum  of  any  number  of  complex  numbers  is  not 
greater  than  the  sum  of  their  moduli. 


Let  OP,  OP',  ...  be  the  displacements  corresponding  to  the  various 
complex  numbers.  Draw  PQ  equal  and  parallel  to  OP',  QR  equal  and 
parallel  to  OP" ,  and  so  on.  Finally  we  reach  a  point  U,  such  that 


OU  =  OP  +  OP’  +  OP"  +  .... 
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The  length  OU  is  the  modulus  of  the  sum  of  the  complex  numbers,  whereas 
the  sum  of  their  moduli  is  the  total  length  of  the  broken  line  OPQR  . . .  U, 
which  is  not  less  than  OU. 

A  purely  arithmetical  proof  of  this  theorem  is  outlined  in  Exs.  XXI.  1. 

46.  We  add  some  theorems  concerning  rational  functions  of  complex 
numbers.  A  rational  function  of  the  complex  variable  z  is  defined  exactly 
as  is  a  rational  function  of  a  real  variable  x,  viz.  as  the  quotient  of  two 
polynomials  in  0. 

Theorem  1.  Any  rational  function  R(z)  can  be  reduced  to  the  form 
X  +  Yi,  where  X  and  Y  are  rational  functions  of  x  and  y  with  real  coeffi¬ 
cients. 

In  the  first  place  it  is  evident  that  any  polynomial  P(x  +  yi)  can  be 
reduced,  in  virtue  of  the  definitions  of  addition  and  multiplication,  to  the 
form  A  +  Bi,  where  A  and  B  are  polynomials  in  x  and  y  with  real  coeffi¬ 
cients.  Similarly  Q(x  +  yi)  can  be  reduced  to  the  form  C  +  Di.  Hence 

R(x  +  yi)  =  P(x  +  yi)/Q(x  +  yi) 
can  be  expressed  in  the  form 

(A  +  Bi)/{C  +  Di)  =  (A  +  Bi)(C  -  Di)/(C  +  Di){C  -  Di) 

AC  +  BD  |  BC-AD . 

~  C2  +  D 2  +  C 2  +  D2 


which  proves  the  theorem. 

Theorem  2.  If  R(x  +  yi)  =  X  +  Yi,  R  denoting  a  rational  function 
as  before,  but  with  real  coefficients,  then  R(x  —  yi)  =  X  —  Yi. 

In  the  first  place  this  is  easily  verified  for  a  power  (. x+yi)n  by  actual  ex¬ 
pansion.  It  follows  by  addition  that  the  theorem  is  true  for  any  polynomial 
with  real  coefficients.  Hence,  in  the  notation  used  above, 

A  —  Bi  _  AC  +  BD  BC-AD. 

C  -Di  ~  C2  +  D 2  C 2  +  D2  h 


R(x  —  yi) 
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the  reduction  being  the  same  as  before  except  that  the  sign  of  i  is  changed 
throughout.  It  is  evident  that  results  similar  to  those  of  Theorems  1  and  2 
hold  for  functions  of  any  number  of  complex  variables. 

Theorem  3.  The  roots  of  an  equation 

aoZn  T  Q\Zn  T  ■  ■  ■  T  an  —  0, 

whose  coefficients  are  real,  may,  in  so  far  as  they  are  not  themselves  real, 
be  arranged  in  conjugate  pairs. 

For  it  follows  from  Theorem  2  that  i f  x  +  yi  is  a  root  then  so  is  x  —  yi. 
A  particular  case  of  this  theorem  is  the  result  (§  43)  that  the  roots  of  a 
quadratic  equation  with  real  coefficients  are  either  real  or  conjugate. 

This  theorem  is  sometimes  stated  as  follows:  in  an  equation  with  real 
coefficients  complex  roots  occur  in  conjugate  pairs.  It  should  be  compared 
with  the  result  of  Exs.  VIII.  7,  which  may  be  stated  as  follows:  in  an 
equation  with  rational  coefficients  irrational  roots  occur  in  conjugate  pairs* 

Examples  XXI.  1.  Prove  theorem  (6)  of  §  45  directly  from  the  defini¬ 
tions  and  without  the  aid  of  geometrical  considerations. 

[First,  to  prove  that  \z  +  z'\  fj  \z\  +  l^l  is  to  prove  that 

( x  +  x')2  +  (y  +  y')2  ^  { i/x2  +  y2  +  \/ x'2  +  y'2}2. 

The  theorem  is  then  easily  extended  to  the  general  case.] 

2.  The  one  and  only  case  in  which 

\z\  +  | z'\  +  •  •  •  =  \z  +  z1  +  .  .  .  |, 

is  that  in  which  the  numbers  z,  z' ,  ...  have  all  the  same  amplitude.  Prove  this 
both  geometrically  and  analytically. 

3.  The  modulus  of  the  sum  of  any  number  of  complex  numbers  is  not  less 
than  the  sum  of  their  real  (or  imaginary)  parts. 

4.  If  the  sum  and  product  of  two  complex  numbers  are  both  real,  then  the 
two  numbers  must  either  be  real  or  conjugate. 

*The  numbers  a  +  Vb ,  a  —  Vb,  where  a,  b  are  rational,  are  sometimes  said  to  be 
‘conjugate’. 
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5.  If 

a  +  bV 2  +  (c  +  dV2)i  =  A  +  B\f2  +  (C  +  D\/2)i, 
where  a,  b,  c,  d.  A,  B,  C,  D  are  real  rational  numbers,  then 

a  =  A,  b  =  B,  c  =  C,  d  =  D. 


6.  Express  the  following  numbers  in  the  form  A  +  Bi,  where  A  and  B  are 
real  numbers: 

2  (1  +  i\2  f1  A  +  ^  ( X+ju\ 2  /A  -  yi\2 

\  1  —  i )  ’  \  1  +  i )  ’  A  —  (xi  \X  —  fii)  \X  +  /j,i)  ’ 

where  A  and  /j  are  real  numbers. 

7.  Express  the  following  functions  of  z  =  x  +  yi  in  the  form  X  +  Yi,  where 
X  and  Y  are  real  functions  of  x  and  y:  z2 ,  z 3,  zn,  1/z,  z+(\/z),  (a+f3z)/(j+5z), 
where  a,  f3,  7,  5  are  real  numbers. 

8.  Find  the  moduli  of  the  numbers  and  functions  in  the  two  preceding 
examples. 

9.  The  two  lines  joining  the  points  z  =  a,  z  =  b  and  z  =  c,  z  =  d  will  be 
perpendicular  if 

am  (^)  =  ±b’ 

i.e.  if  (a  —  b)/(c  —  d)  is  purely  imaginary.  What  is  the  condition  that  the  lines 
should  be  parallel? 

10.  The  three  angular  points  of  a  triangle  are  given  by  z  =  a,  z  =  f3,  z  =  7, 
where  a,  (3,  7  are  complex  numbers.  Establish  the  following  propositions: 

(i)  the  centre  of  gravity  is  given  by  z  =  ^(a  +  (3  +  7) ; 

(ii)  the  circum-centre  is  given  by  \z  —  a\  =  \z  —  /3|  =  \z  —  7I; 

(iii)  the  three  perpendicidars  from  the  angular  points  on  the  opposite  sides 
meet  in  a  point  given  by 


R 


z  —  a 


13  ~  7 


=  R 


z  —  (3 

7  —  a 


=  R 


2  —  7 

a  —  (3 


=  0: 


(iv)  there  is  a  point  P  inside  the  triangle  such  that 


CBP  =  ACP  =  BAP  =  u, 
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and 

cot  co  =  cot  A  +  cot  B  +  cot  C. 

[To  prove  (iii)  we  observe  that  if  A,  B,  C  are  the  vertices,  and  P  any  point  2, 
then  the  condition  that  AP  should  be  perpendicular  to  BC  is  (Ex.  9)  that 
(z  —  a)/ (j3  —  7)  should  be  purely  imaginary,  or  that 

R (z  -  a)  R(/3  -  7)  +  I (z  -  a)  I{/3  -  7)  =  0. 

This  equation,  and  the  two  similar  equations  obtained  by  permuting  a,  (3 ,  7 
cyclically,  are  satisfied  by  the  same  value  of  2,  as  appears  from  the  fact  that  the 
sum  of  the  three  left-hand  sides  is  zero. 

To  prove  (iv),  take  BC  parallel  to  the  positive  direction  of  the  axis  of  x. 
Then* 

7  —  P  =  a,  a  —  7  =  —  6Cis(— C),  f3  —  a  =  —cCisB. 

We  have  to  determine  2  and  co  from  the  equations 

(2  -  op  (ftp  -  tap )  =  (2  -  P){  70  -  jgo)  =  (2  -  7)(«0  -  70)  = 

(20  —  ao)((3  —  a)  (20  -  /30)  (7  -  P)  (zo-7o)(a-7) 

where  20,  «o,  Po,  70  denote  the  conjugates  of  2,  a,  (3,  7. 

Adding  the  numerators  and  denominators  of  the  three  equal  fractions,  and 
using  the  equation 

i  cot  co  =  (1  +  Cis2w)/(1  —  Cis2w), 

we  find  that 

•  .  (P  ~  7)  (A)  -  7o)  +  (7  -  «)  (70  -  ao)  +  (a  -  P){&o  -  /?o) 

zcota;  = - - - - - - - - - . 

P 7o  -  Po7  +  7«o  -  7o«  +  apo  -  «oP 

From  this  it  is  easily  deduced  that  the  value  of  cot  co  is  ( a 2  +  b2  +  c2)/ 4 A,  where 
A  is  the  area  of  the  triangle;  and  this  is  equivalent  to  the  result  given. 

To  determine  2,  we  multiply  the  numerators  and  denominators  of  the  equal 
fractions  by  (70  -Po)/(/3-a),  (a0  -  7o)/(7~  P),  (Po  ~  «o)/(«  -  7),  and  add  to 
form  a  new  fraction.  It  will  be  found  that 

aa  Cis  A  +  6/3  Cis  B  +  07  Cis  C 
'  =  a  Cis  A  +  b  Cis  B  +  c  Cis  C 

*We  suppose  that  as  we  go  round  the  triangle  in  the  direction  ABC  we  leave  it  on 
our  left. 
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11.  The  two  triangles  whose  vertices  are  the  points  a,  6,  c  and  x,  y,  z  re¬ 
spectively  will  be  similar  if 


1  1 

a  b 
x  y 


1 


c 

z 


=  0 


[The  condition  required  is  that  AB / AC  =  XY / XZ  (large  letters  denoting 
the  points  whose  arguments  are  the  corresponding  small  letters),  or 
(6  —  a)/ (c  —  a)  =  (y  —  x)/(z  —  x),  which  is  the  same  as  the  condition  given.] 

12.  Deduce  from  the  last  example  that  if  the  points  x,  y,  z  are  collinear  then 
we  can  find  real  numbers  a,  f3,  7  such  that  a  +  (3  +  7  =  0  and  ax  +  f3y  +  72  =  0, 
and  conversely  (cf.  Exs.  xx.  4).  [Use  the  fact  that  in  this  case  the  triangle  formed 
by  x ,  y,  z  is  similar  to  a  certain  line-triangle  on  the  axis  OX,  and  apply  the 
result  of  the  last  example.] 

13.  The  general  linear  equation  with  complex  coefficients.  The  equa¬ 
tion  az  +  (3  =  0  has  the  one  solution  2  =  —(/3/a),  unless  a  =  0.  If  we  put 


a  =  a  +  Ai,  /3  =  b  +  Bi,  z  =  x  +  yi, 


and  equate  real  and  imaginary  parts,  we  obtain  two  equations  to  determine  the 
two  real  numbers  x  and  y.  The  equation  will  have  a  real  root  if  y  =  0,  which 
gives  ax  +  b  =  0,  Ax  +  B  =  0,  and  the  condition  that  these  equations  should  be 
consistent  is  aB  —  bA  =  0. 

14.  The  general  quadratic  equation  with  complex  coefficients.  This 
equation  is 

(a  +  Ai)z2  +  2(6  +  Bi)z  +  (c  +  Ci )  =  0. 

Unless  a  and  A  are  both  zero  we  can  divide  through  by  a  +  iA.  Hence  we 
may  consider 

z2  +  2(6  +  Bi)z  +  (c  +  Ci)  =  0  (1) 

as  the  standard  form  of  our  equation.  Putting  z  =  x  +  yi  and  equating  real  and 
imaginary  parts,  we  obtain  a  pair  of  simultaneous  equations  for  x  and  y,  viz. 

x2  —  y2  +  2(bx  —  By)  +  c  =  0,  2  xy  +  2(by  +  Bx)  +  C  =  0. 

If  we  put 

x  +  6  =  £,  y  +  B  =  rj,  b2  —  B2  —  c  =  h,  2bB  —  C  =  k, 
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these  equations  become 

£2  —  if  =  h,  2  £77  =  k. 

Squaring  and  adding  we  obtain 

f2  +  rf  =  \J  h 2  +  k2,  £  =  ±  -y/ +  k2  +  /r},  i]  =  ±  y/  -  ^}- 

We  must  choose  the  signs  so  that  £7/  has  the  sign  of  k:  i.e.  if  k  is  positive  we 
must  take  like  signs,  if  k  is  negative  unlike  signs. 

Conditions  for  equal  roots.  The  two  roots  can  only  be  equal  if  both  the 
square  roots  above  vanish,  i.e.  if  h  =  0,  k  =  0,  or  if  c  =  b2  —  B2,  C  =  2 bB. 
These  conditions  are  equivalent  to  the  single  condition  c  +  Ci  =  (6  + Hi)2,  which 
obviously  expresses  the  fact  that  the  left-hand  side  of  (1)  is  a  perfect  square. 

Condition  for  a  real  root.  If  x2  +  2(b+Bi)x  +  {c+Ci)  =  0,  where  x  is  real, 
then  x 2  +  2 bx  +  c  =  0,  2 Bx  +  (7  =  0.  Eliminating  x  we  find  that  the  required 
condition  is 

C2  -  AbBC  +  4cH2  =  0. 

Condition  for  a  purely  imaginary  root.  This  is  easily  found  to  be 

C2  -  AbBC  -  4 b2c  =  0. 

Conditions  for  a  pair  of  conjugate  complex  roots.  Since  the  sum  and 
the  product  of  two  conjugate  complex  numbers  are  both  real,  b  +  Bi  and  c  +  Ci 
must  both  be  real,  i.e.  B  =  0,  C  =  0.  Thus  the  equation  (1)  can  have  a  pair  of 
conjugate  complex  roots  only  if  its  coefficients  are  real.  The  reader  should  verify 
this  conclusion  by  means  of  the  explicit  expressions  of  the  roots.  Moreover,  if 
b2  ^  c,  the  roots  will  be  real  even  in  this  case.  Hence  for  a  pair  of  conjugate 
roots  we  must  have  B  =  0,  C  =  0,  b2  <  c. 

15.  The  Cubic  equation.  Consider  the  cubic  equation 

z3  +  3  Hz  +  (7  =  0, 

where  G  and  H  are  complex  numbers,  it  being  given  that  the  equation  has 
(a)  a  real  root,  ( b )  a  purely  imaginary  root,  (c)  a  pair  of  conjugate  roots.  If 
H  =  A  +  pi,  G  =  p  +  cri,  we  arrive  at  the  following  conclusions. 


[Ill :  46] 


COMPLEX  NUMBERS 


110 


(a)  Conditions  for  a  real  root.  If  p  is  not  zero,  then  the  real  root 
is  — cr/ 3/u,,  and  o"3  +  27A p2cr  —  27 p3  p  =  0.  On  the  other  hand,  if  p  =  0  then  we 
must  also  have  a  =  0,  so  that  the  coefficients  of  the  equation  are  real.  In  this 
case  there  may  be  three  real  roots. 

( b )  Conditions  for  a  purely  imaginary  root.  If  p  is  not  zero  then  the 
purely  imaginary  root  is  (p/3p)i,  and  p3  —  27A p2p  —  27 p3a  =  0.  If  p  =  0  then 
also  p  =  0,  and  the  root  is  yi ,  where  y  is  given  by  the  equation  y 3  —  3A y  —  a  =  0, 
which  has  real  coefficients.  In  this  case  there  may  be  three  purely  imaginary 
roots. 

(c)  Conditions  for  a  pair  of  conjugate  complex  roots.  Let  these  be  x+yi 
and  x  —  yi.  Then  since  the  sum  of  the  three  roots  is  zero  the  third  root  must 
be  —2x.  From  the  relations  between  the  coefficients  and  the  roots  of  an  equation 
we  deduce 

y2  —  3x2  =  3  H,  2x{x2  +  y2)  =  G. 

Hence  G  and  H  must  both  be  real. 

In  each  case  we  can  either  find  a  root  (in  which  case  the  equation  can  be 
reduced  to  a  quadratic  by  dividing  by  a  known  factor)  or  we  can  reduce  the 
solution  of  the  equation  to  the  solution  of  a  cubic  equation  with  real  coefficients. 

16.  The  cubic  equation  x3+aix2+a2X+a3  =  0,  where  ai  =  . . . ,  has 

a  pair  of  conjugate  complex  roots.  Prove  that  the  remaining  root  is  —A^a^/A'^, 
unless  Hg  =  0.  Examine  the  case  in  which  Ar3  =  0. 

17.  Prove  that  if  z3  +  3 Hz  +  G  =  0  has  two  complex  roots  then  the  equation 

8a3  +  6  aH  —  G  =  0 

has  one  real  root  which  is  the  real  part  a  of  the  complex  roots  of  the  original 
equation;  and  show  that  a  has  the  same  sign  as  G. 

18.  An  equation  of  any  order  with  complex  coefficients  will  in  general  have 
no  real  roots  nor  pairs  of  conjugate  complex  roots.  How  many  conditions  must 
be  satisfied  by  the  coefficients  in  order  that  the  equation  should  have  (a)  a  real 
root,  ( b )  a  pair  of  conjugate  roots? 
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19.  Coaxal  circles.  In  Fig.  26,  let  a,  b,  z  be  the  arguments  of  A,  B,  P. 
Then 

z  —  b 

am - =  APB , 

z  —  a 

if  the  principal  value  of  the  amplitude  is  chosen.  If  the  two  circles  shown  in  the 
figure  are  equal,  and  z',  z±,  z[  are  the  arguments  of  P',  P\,  P{,  and  APB  =  6 , 
it  is  easy  to  see  that 


z'  -b 

am -  =  ir  —  9, 

z'  —  a 


am 


zi  —  b 
z\  —  a 


and 


z[  —  b 

am  — -  =  —7 r  +  0. 


z\  —  a 


The  locus  defined  by  the  equation 


z~b  0 
am - =  c/, 

z  —  a 


where  0  is  constant,  is  the  arc  APB.  By  writing  ir  —  6,  —6,  —ir  +  9  for  6,  we 
obtain  the  other  three  arcs  shown. 

The  system  of  equations  obtained  by  supposing  that  6  is  a  parameter,  varying 
from  — 7T  to  +7r,  represents  the  system  of  circles  which  can  be  drawn  through  the 
points  A,  B.  It  should  however  be  observed  that  each  circle  has  to  be  divided 
into  two  parts  to  which  correspond  different  values  of  9. 

20.  Now  let  us  consider  the  equation 


where  A  is  a  constant. 

Let  K  be  the  point  in  which  the  tangent  to  the  circle  ABP  at  P  meets  AB. 
Then  the  triangles  KPA,  KBP  are  similar,  and  so 

AP/PB  =  PK/BK  =  KA/KP  =  A. 

Hence  KA/KB  =  A2,  and  therefore  K  is  a  fixed  point  for  all  positions  of  P 
which  satisfy  the  equation  (1).  Also  KP 2  =  KA  ■  KB ,  and  so  is  constant. 
Hence  the  locus  of  P  is  a  circle  whose  centre  is  K. 
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The  system  of  equations  obtained  by  varying  A  represents  a  system  of  circles, 
and  every  circle  of  this  system  cuts  at  right  angles  every  circle  of  the  system  of 
Ex.  19. 

The  system  of  Ex.  19  is  called  a  system  of  coaxal  circles  of  the  common  point 
kind.  The  system  of  Ex.  20  is  called  a  system  of  coaxal  circles  of  the  limiting 
point  kind ,  A  and  B  being  the  limiting  points  of  the  system.  If  A  is  very  large  or 
very  small  then  the  circle  is  a  very  small  circle  containing  A  or  B  in  its  interior. 

21.  Bilinear  Transformations.  Consider  the  equation 

z  =  Z  +  a,  (1) 

where  z  =  x  +  yi  and  Z  =  X  +  Yi  are  two  complex  variables  which  we  may  sup¬ 
pose  to  be  represented  in  two  planes  xoy,  XOY .  To  every  value  of  z  corresponds 
one  of  Z.  and  conversely.  If  a  =  a  +  fh  then 


x  =  X  +  a,  y  =  Y  +  /3, 
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and  to  the  point  (x,y)  corresponds  the  point  (X,  Y).  If  (x,y)  describes  a  curve 
of  any  kind  in  its  plane,  ( X ,  Y)  describes  a  curve  in  its  plane.  Thus  to  any  figure 
in  one  plane  corresponds  a  figure  in  the  other.  A  passage  of  this  kind  from  a 
figure  in  the  plane  xoy  to  a  figure  in  the  plane  XOY  by  means  of  a  relation  such 
as  (1)  between  z  and  Z  is  called  a  transformation.  In  this  particular  case  the 
relation  between  corresponding  figures  is  very  easily  defined.  The  (X,  Y)  figure 
is  the  same  in  size,  shape,  and  orientation  as  the  (x,  y)  figure,  but  is  shifted  a 
distance  a  to  the  left,  and  a  distance  /?  downwards.  Such  a  transformation  is 
called  a  translation. 

Now  consider  the  equation 

z  =  pZ,  (2) 

where  p  is  real.  This  gives  x  =  pX,  y  =  pY .  The  two  figures  are  similar 
and  similarly  situated  about  their  respective  origins,  but  the  scale  of  the  (x,  y) 
figure  is  p  times  that  of  the  (X,  Y)  figure.  Such  a  transformation  is  called  a 
magnification. 

Finally  consider  the  equation 

z  =  (cos  (j)  +  i  sin  <j>)Z.  (3) 

It  is  clear  that  \z\  =  \Z\  and  that  one  value  of  am z  is  am Z  +  and  that  the 
two  figures  differ  only  in  that  the  (x,  y)  figure  is  the  (X,  Y)  figure  turned  about 
the  origin  through  an  angle  f>  in  the  positive  direction.  Such  a  transformation 
is  called  a  rotation. 

The  general  linear  transformation 

z  =  aZ  +  b  (4) 

is  a  combination  of  the  three  transformations  (1),  (2),  (3).  For,  if  |a|  =  p  and 
am  a  =  (j),  we  can  replace  (4)  by  the  three  equations 

z  =  z'  +  b,  z!  =  pZ\  Z'  =  (cos  4>  +  i  sin  <f)Z. 

Thus  the  general  linear  transformation  is  equivalent  to  the  combination  of  a 
translation,  a  magnification,  and  a  rotation. 

Next  let  us  consider  the  transformation 


z  =  l/Z. 


(5) 
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If  \Z\  =  R  and  amZ  =  0,  then  |^;|  =  1/R  and  am  2  =  —0,  and  to  pass  from 
the  (x,  y )  figure  to  the  ( X ,  Y)  figure  we  invert  the  former  with  respect  to  o,  with 
unit  radius  of  inversion,  and  then  construct  the  image  of  the  new  figure  in  the 
axis  ox  ( i.e .  the  symmetrical  figure  on  the  other  side  of  ox). 

Finally  consider  the  transformation 


aZ  +  b 
cZ  +  d 


(6) 


This  is  equivalent  to  the  combination  of  the  transformations 


2  =  (a/c)  +  (be  —  ad)(z' /c),  d  =  1  /Z' ,  Z'  =  cZ  +  d, 


i.e.  to  a  certain  combination  of  transformations  of  the  types  already  considered. 

The  transformation  (6)  is  called  the  general  bilinear  transformation.  Solving 
for  Z  we  obtain 

Z=  tzt. 

cz  —  a 

The  general  bilinear  transformation  is  the  most  general  type  of  transforma¬ 
tion  for  which  one  and  only  one  value  of  2  corresponds  to  each  value  of  Z .  and 
conversely. 


22.  The  general  bilinear  transformation  transforms  circles  into  circles. 
This  may  be  proved  in  a  variety  of  ways.  We  may  assume  the  well-known 
theorem  in  pure  geometry,  that  inversion  transforms  circles  into  circles  (which 
may  of  course  in  particular  cases  be  straight  lines).  Or  we  may  use  the  results 
of  Exs.  19  and  20.  If,  e.g .,  the  (x,  y)  circle  is 

\(z  -  o)/(z  -  p)\  =  A, 

and  we  substitute  for  2  in  terms  of  Z,  we  obtain 

\(Z-a')/(Z-p')\  =  \', 

where 

,  b  —  ad  ,  b  —  pd  , 

a  = - ,  p  = - ,  A  = 

a  —  ac  a  —  pc 

23.  Consider  the  transformations  2  =  1  /Z,  z  =  (1  +  Z)/(\  —  Z ),  and  draw 
the  (X,  Y)  curves  which  correspond  to  (1)  circles  whose  centre  is  the  origin, 
(2)  straight  lines  through  the  origin. 


a  —  pc 


a  —  ac 


A. 
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24.  The  condition  that  the  transformation  z  =  ( aZ  +  6)/ (cZ  +  d)  should 
make  the  circle  x2  +  y2  =  1  correspond  to  a  straight  line  in  the  ( X ,  Y)  plane  is 
|a|  =  |c|. 

25.  Cross  ratios.  The  cross  ratio  (21,22523,24)  is  defined  to  be 

(zi  -  z3)(z2  -  24) 

(21  -  24)  (22  -  23)' 

If  the  four  points  21,  22,  23,  24  are  on  the  same  line,  this  definition  agrees 
with  that  adopted  in  elementary  geometry.  There  are  24  cross  ratios  which  can 
be  formed  from  21,  2 2,  23,  24  by  permuting  the  suffixes.  These  consist  of  six 
groups  of  four  equal  cross  ratios.  If  one  ratio  is  A,  then  the  six  distinct  cross 
ratios  are  A,  1  —  A,  1/A,  1/(1  —  A),  (A  —  1)/A,  A/(A  —  1).  The  four  points  are 
said  to  be  harmonic  or  harmonically  related  if  any  one  of  these  is  equal  to  —  1. 
In  this  case  the  six  ratios  are  —1,  2,  —1,  4,  2,  1. 

If  any  cross  ratio  is  real  then  all  are  real  and  the  four  points  lie  on  a  circle. 
For  in  this  case 

(zi  -  23) (22  -  24) 
am  (zi  -  24) (22  -  23) 

must  have  one  of  the  three  values  —7 r,  0,  7 r,  so  that  am{(2i  —  23)  f(z\  —  24)}  and 
am{(22  —  23)/ (22  —  24)}  must  either  be  equal  or  differ  by  7r  (cf.  Ex.  19). 

If  (21,  22;  23,  24)  =  —1,  we  have  the  two  equations 

z\  -  23  _  22  -  23 
21  -  24  22  —  24 

The  four  points  A\ ,  A2,  A3,  A4  lie  on  a  circle,  A\  and  A 2  being  separated 
by  A3  and  A4.  Also  A1A3/A1A4  =  A2A3/A2A4.  Let  O  be  the  middle  point 
of  A3A4.  The  equation 

(zi  -  23)  (22  -  24)  _ 

(21  -  24)  (22  -  23) 

may  be  put  in  the  form 

(2l  +  22)  (23  +  24)  =  2(2122  +  2324), 
or,  what  is  the  same  thing, 

{21  -  2 (23  T  24)}{22  -  3(23  +  24)}  =  {3(23  -  24 ) } 2 . 


21-23  22  -  23 

am -  =  ±7r  +  am - , 

21  —  24  22  -  24 
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But  this  is  equivalent  to  OA\  •  OA2  =  OA3  =  OA 4  .  Hence  OA\  and  OA2  make 
equal  angles  with  A3A4,  and  OA\  ■  OA 2  =  OA2  =  OA2.  It  will  be  observed  that 
the  relation  between  the  pairs  A±,  A2  and  A3,  A4  is  symmetrical.  Hence,  if  O'  is 
the  middle  point  of  A4A2,  O' A3  and  O' A4  are  equally  inclined  to  A1A2,  and 
O' Az  ■  0'A4  =  O'  A2  =  O'Al 

26.  If  the  points  A±,  A2  are  given  by  az2  +  2bz  +  c  =  0,  and  the  points  A3,  A4 
by  a'z2  +  2b'z+d  =  0,  and  O  is  the  middle  point  of  A3A4,  and  ad +a7 c—2bb'  =  0, 
then  OAi,  OA2  are  equally  inclined  to  A3A4  and  OA\  ■  OA2  =  OA 2  =  OA2. 

(Math.  Trip.  1901.) 

27.  AB,  CD  are  two  intersecting  lines  in  Argand’s  diagram,  and  P,  Q  their 
middle  points.  Prove  that,  if  AB  bisects  the  angle  CPD  and  PA2  =  PB 2  = 
PC  ■  PD,  then  CD  bisects  the  angle  AQB  and  QC 2  =  QD 2  =  QA  •  QB. 

(Math.  Trip.  1909.) 

28.  The  condition  that  four  points  should  lie  on  a  circle.  A  sufficient 
condition  is  that  one  (and  therefore  all)  of  the  cross  ratios  should  be  real  (Ex.  25); 
this  condition  is  also  necessary.  Another  form  of  the  condition  is  that  it  should 
be  possible  to  choose  real  numbers  a,  (3,  7  such  that 

1  1  1 

a  (3  7=0. 

ZiZ4  +  Z2Z3  2224  +  Z3ZI  Z3Z4  +  ZlZ2 

[To  prove  this  we  observe  that  the  transformation  Z  =  1  /(z  —  Z4)  is 
equivalent  to  an  inversion  with  respect  to  the  point  Z4,  coupled  with  a  certain 
reflexion  (Ex.  21).  If  z\,  Z2,  Z3  lie  on  a  circle  through  Z4,  the  corresponding 
points  Z\  =  1/(21  —  Z4),  Z2  =  1/(^2  —  Z4),  Z3  =  1/(23  —  Z4)  lie  on  a  straight  line. 
Hence  (Ex.  12)  we  can  find  real  numbers  a! ,  (3' ,  y7  such  that  a'  +  f3'  +  y7  =  0 
and  a7/ (21  —  24)  +  (3' / (z 2  —  24)  +  y7/ (23  —  24)  =  0,  and  it  is  easy  to  prove  that 
this  is  equivalent  to  the  given  condition.] 

29.  Prove  the  following  analogue  of  De  Moivre’s  Theorem  for  real  numbers: 
if  />!,  i/>2,  ^>3,  . . .  is  a  series  of  positive  acute  angles  such  that 

tan  d'lTi- ri  —  tan  dm  sec  7 1  I  sec  dni  tan  7 1 , 

tan  dm-\-n  —  tan  dm  see  dn  T  sec  dm  tan  dni 
Sec  dm+n  —  Sec  dm  Sec  dn  T  tan  dm  tan  dni 


then 
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and 

tan  (pm  +  sec  4>m  =  (tan  (f>\  +  sec  c t>i)m ■ 

[Use  the  method  of  mathematical  induction.] 

30.  The  transformation  z  =  Zm .  In  this  case  r  =  Rm ,  and  9  and  m© 
differ  by  a  multiple  of  27 r.  If  Z  describes  a  circle  round  the  origin  then  z  describes 
a  circle  round  the  origin  m  times. 

The  whole  ( x ,  y )  plane  corresponds  to  any  one  of  m  sectors  in  the  (X,  Y) 
plane,  each  of  angle  2-k /m.  To  each  point  in  the  (x,  y )  plane  correspond  m  points 
in  the  (X,  Y )  plane. 

31.  Complex  functions  of  a  real  variable.  If  /(t),  (j)(t)  are  two  real 
functions  of  a  real  variable  t  defined  for  a  certain  range  of  values  of  t,  we  call 

z  =  f(t) +  i(f>(t)  (1) 

a  complex  function  of  t.  We  can  represent  it  graphically  by  drawing  the  curve 

x  =  f(t ),  y  =  <t>{ty, 


the  equation  of  the  curve  may  be  obtained  by  eliminating  t  between  these  equa¬ 
tions.  If  z  is  a  polynomial  in  t,  or  rational  function  of  t,  with  complex  coefficients, 
we  can  express  it  in  the  form  (1)  and  so  determine  the  curve  represented  by  the 
function. 

(i)  Let 

z  =  a  +  (b  —  a)t , 


where  a  and  b  are  complex  numbers.  If  a  =  a  +  a'i ,  b  =  (3  +  (3'i,  then 


x  =  a  +  ((3  —  a)t,  y  =  a  +  (/31  —  a)t. 


The  curve  is  the  straight  line  joining  the  points  z  =  a  and  z  =  b.  The  segment 
between  the  points  corresponds  to  the  range  of  values  of  t  from  0  to  1.  Find  the 
values  of  t  which  correspond  to  the  two  produced  segments  of  the  line. 

(ii)  If 


z  =  c  +  p 


1  +  ti 
1  —  ti 


where  p  is  positive,  then  the  curve  is  the  circle  of  centre  c  and  radius  p.  As 
t  varies  through  all  real  values  z  describes  the  circle  once. 


[Ill :  48] 


COMPLEX  NUMBERS 


118 


(iii)  In  general  the  equation  z  =  (a  +  bt)/(c  +  dt)  represents  a  circle.  This 
can  be  proved  by  calculating  x  and  y  and  eliminating:  but  this  process  is  rather 
cumbrous.  A  simpler  method  is  obtained  by  using  the  result  of  Ex.  22.  Let 
z  =  (a  +  bZ)/{c  +  dZ ),  Z  =  t.  As  t  varies  Z  describes  a  straight  line,  viz.  the 
axis  of  X.  Hence  z  describes  a  circle. 

(iv)  The  equation 

z  =  a  +  2bt  +  ct 2 

represents  a  parabola  generally,  a  straight  line  if  b/c  is  real. 

(v)  The  equation  z  =  (a  +  2 bt  +  ct2)/ ( a  +  2 fit  +  7f2),  where  a,  j3,  7  are 
real,  represents  a  conic  section. 

[Eliminate  t  from 

x  =  {A  +  2Bt  +  Ct2)/(a  +  2(3t  +  yt2),  y  =  {A1  +  2B’t  +  C’t2)/(a  +  2f3t  +  7f2), 
where  A  +  A'i  =  a,  B  +  B'i  =  6,  C  +  C'i  =  c.] 

47.  Roots  of  complex  numbers.  We  have  not,  up  to  the  present, 
attributed  any  meaning  to  symbols  such  as  {/a,  am^n,  when  a  is  a  complex 
number,  and  m  and  n  integers.  It  is,  however,  natural  to  adopt  the  def¬ 
initions  which  are  given  in  elementary  algebra  for  real  values  of  a.  Thus 
we  define  \fa  or  ci1/n,  where  n  is  a  positive  integer,  as  a  number  z  which 
satisfies  the  equation  zn  =  a;  and  am/n,  where  m  is  an  integer,  as  (a1/”)7”. 
These  definitions  do  not  prejudge  the  question  as  to  whether  there  are  or 
are  not  more  than  one  (or  any)  roots  of  the  equation. 

48.  Solution  of  the  equation  zn  =  a.  Let 

a  =  p(cos  0  +  i  sin  0) , 

where  p  is  positive  and  0  is  an  angle  such  that  —  n  <  0  ^  n.  If  we  put 
z  =  r(cos9  +  i sin#),  the  equation  takes  the  form 

rn (cos  n6  +  i  sin  n6)  =  p(cos  0  +  i  sin  0) ; 

so  that 

rn  =  p,  cosnO  —  cos0,  sin  nO  —  sin  0.  (1) 


[Ill :  48] 


COMPLEX  NUMBERS 


119 


The  only  possible  value  of  r  is  ^/p,  the  ordinary  arithmetical  nth  root 
of  p;  and  in  order  that  the  last  two  equations  should  be  satisfied  it  is 
necessary  and  sufficient  that  n6  =  (f>  +  2kn,  where  k  is  an  integer,  or 

6  =  (0  +  2kn)/n. 

If  k  —  pn  +  q,  where  p  and  q  are  integers,  and  0  ^  q  <  n,  the  value  of  6 
is  2pir  +  (0  +  2qn) /n,  and  in  this  the  value  of  p  is  a  matter  of  indifference. 
Hence  the  equation 

zn  =  a  =  p(cos  4>  +  i  sin  0) 

has  n  roots  and  n  only,  given  by  z  =  r(cos6)  +  isin0),  where 

r  =  \Tp,  0  =  (0  +  2qn)/n,  (q  =  0,  1,  2,  . . . ,  n  -  1). 

That  these  n  roots  are  in  reality  all  distinct  is  easily  seen  by  plotting 
them  on  Argand’s  diagram.  The  particular  root 

yp{cos(0/n)  +  i  sin(0/n)} 

is  called  the  principal  value  of  ^ a . 

The  case  in  which  o  =  l,p  =  l,0  =  Oisof  particular  interest.  The 
n  roots  of  the  equation  xn  =  1  are 

cos(2g7r/n)  +  i  sm(2qn/n),  (q  =  0,  1,  . . . ,  n  —  1). 

These  numbers  are  called  the  nth  roots  of  unity;  the  principal  value  is  unity 
itself.  If  we  write  c on  for  cos(27r/n)  +  i  sin(27r/n),  we  see  that  the  nth  roots 
of  unity  are 

1,  unf  u2n,  ...,  u”'1. 

Examples  XXII.  1.  The  two  square  roots  of  1  are  1,  —1;  the  three 
cube  roots  are  1,  ^(— 1  +  i\J 3),  |(— 1  —  i\/3);  the  four  fourth  roots  are  1,  i ,  — 1, 
— i;  and  the  five  fifth  roots  are 


Vh  —  1  +  i\ 

/l0  +  2\/5 

l 

’  4 

—  \/5  —  1  +  i  \ 

/l0  —  2\/5 

—  y/5—  1  —  i  \ 

/l0-2v/5 

1 

’  4 

V5  —  1  —  i\ 

/ 10  +  2V5 
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2.  Prove  that 

1  +  uin  +  +  •  •  •  +  w”  =  0. 

3.  Prove  that 

(x  +  yus  +  zu|)(x  +  yu: f  +  zuj3)  =  x2  +  y2  +  z2  -  yz  —  zx  —  xy. 

4.  The  nth  roots  of  a  are  the  products  of  the  nth  roots  of  unity  by  the 
principal  value  of  y/a. 

5.  It  follows  from  Exs.  xxi.  14  that  the  roots  of 

z2  =  a  +  f3i 


are 

±  \J ^{\/a2~+  (32  +  a }  ±  i \J j32  -  a}, 

like  or  unlike  signs  being  chosen  according  as  ^  is  positive  or  negative.  Show 
that  this  result  agrees  with  the  result  of  §  48. 

6.  Show  that  ( x2m  —  a2m)/(x2  —  a2)  is  equal  to 


7 r 


m 


2v r 


x  —  2  ax  cos - I -alia:—  2  ax  cos - ha 


m 


—  2 ax  cos 


(m  —  1)tt  9 
- - —  +a2 


m 


[The  factors  of  x2m  —  a2m  are 

(x  —  a),  (x  —  auj2m),  (x-aw2m),  ...  (x-aw^-1)- 

The  factor  x  —  is  x  +  a.  The  factors  (x  —  aw|m),  (x  —  au2^~s)  taken 

together  give  a  factor  x2  —  2ax cos(s7r/m)  +  a2.] 

7.  Resolve  x2m+1  —  a2m+1,  x2m  +  a2m,  and  x2m+1  +  a2m+1  into  factors  in 
a  similar  way. 

8.  Show  that  x2n  —  2 xnan  cos  9  +  a2n  is  equal  to 


2  (7  o 

x  —  2  xa  cos  — ha" 
n 


2  o  9  2tt  2 

x  —  Zxa  cos - h  a 

n 

/  2  o  0  +  2(n  -  1)7T  2\ 

...  I  x  —  2xa  cos - h  a  J 
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[Use  the  formula 

x2n  —  2 xnan  cos  9  +  a2n  =  {xn  —  an( cos  9  +  i  sin  9)}{xn  —  an( cos  9  —  i  sin  0)}, 

and  split  up  each  of  the  last  two  expressions  into  n  factors.] 

9.  Find  all  the  roots  of  the  equation  x6  —  2x3  +  2  =  0.  (Math.  Trip.  1910.) 

10.  The  problem  of  finding  the  accurate  value  of  un  in  a  numerical  form 
involving  square  roots  only,  as  in  the  formula  oj 3  =  |(— l+i\/3),  is  the  algebraical 
equivalent  of  the  geometrical  problem  of  inscribing  a  regular  polygon  of  n  sides 
in  a  circle  of  unit  radius  by  Euclidean  methods,  i.e.  by  ruler  and  compasses.  For 
this  construction  will  be  possible  if  and  only  if  we  can  construct  lengths  measured 
by  cos(27r/n)  and  sin(27r/n);  and  this  is  possible  (Ch.  II,  Misc.  Exs.  22)  if  and 
only  if  these  numbers  are  expressible  in  a  form  involving  square  roots  only. 

Euclid  gives  constructions  for  n  =  3,  4,  5,  6,  8,  10,  12,  and  15.  It  is  evident 
that  the  construction  is  possible  for  any  value  of  n  which  can  be  found  from 
these  by  multiplication  by  any  power  of  2.  There  are  other  special  values  of  n 
for  which  such  constructions  are  possible,  the  most  interesting  being  n  =  17. 

49.  The  general  form  of  De  Moivre’s  Theorem.  It  follows  from 

the  results  of  the  last  section  that  if  q  is  a  positive  integer  then  one  of  the 
values  of  (cos 9  +  i sin O  f  hi  is 

cos (9/q)  +  i  sin (9/q). 

Raising  each  of  these  expressions  to  the  power  p  (where  p  is  any  integer 
positive  or  negative),  we  obtain  the  theorem  that  one  of  the  values  of 
(cos 6  +  isinQ)p/q  is  cos (pO/q)  +  isin(p9/q),  or  that  if  a  is  any  rational 
number  then  one  of  the  values  of  (cos 6  +  ism6)a  is 

cos  ad  +  i  sin  a9. 

This  is  a  generalised  form  of  De  Moivre’s  Theorem  (§  45). 

MISCELLANEOUS  EXAMPLES  ON  CHAPTER  III. 

1.  The  condition  that  a  triangle  (xyz)  should  be  equilateral  is  that 
x  +  y  +  z  —  yz  —  zx  —  xy  =  U. 
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[Let  XY  Z  be  the  triangle.  The  displacement  ZX  is  Y Z  turned  through 
an  angle  r  in  the  positive  or  negative  direction.  Since  Cis  |vr  =  ix3, 
Cis(—  |7r)  =  I/W3  =  w|.  we  have  x  —  z  =  (z  —  y)u> 3  or  x  —  z  =  (z  —  y)ui Hence 
x  +  yu 3  +  zuj2  =  0  or  x  +  yco2  +  zui3  =  0.  The  result  follows  from  Exs.  XXII.  3.] 

2.  If  XYZ,  X'Y'Z'  are  two  triangles,  and 

YZ  •  Y’Z’  =  ~ZX  ■  Z’X’  =  XY  ■  X'Y', 

then  both  triangles  are  equilateral.  [From  the  equations 

( V  -  z)(y'  -  z')  =  (z-  x){z'  -  x')  =  (x  -  y)(x'  -  y')  =  k2, 

say,  we  deduce  ^  l/(y'  —  z')  =  0,  or  ^  x'2  —  1/ z'  =  0.  Now  apply  the  result 

of  the  last  example.] 

3.  Similar  triangles  BCX,  CAY ,  ABZ  are  described  on  the  sides  of  a 
triangle  ABC.  Show  that  the  centres  of  gravity  of  ABC ,  XY Z  are  coincident. 

[We  have  (x  —  c)/(b  —  c)  =  (y  —  a)/(c  —  a)  =  (z  —  b)/(a  —  b)  =  A,  say.  Express 
+  y  +  z)  in  terms  of  a,  b,  c.] 

4.  If  X,  Y,  Z  are  points  on  the  sides  of  the  triangle  ABC,  such  that 

BX/XC  =  CY/YA  =  AZ/ZB  =  r , 

and  if  ABC,  XY Z  are  similar,  then  either  r  =  1  or  both  triangles  are  equilateral. 

5.  If  A,  B,  C,  D  are  four  points  in  a  plane,  then 

AD  ■  BC  ^  BD  ■  CA  +  CD  •  AB. 


[Let  z\,  Z2,  Z3,  Z4  be  the  complex  numbers  corresponding  to  A,  B,  C,  D. 
Then  we  have  identically 


(aq  -  x4)(x2  -  x3)  +  (x2  -  x4)(x3  -  xi)  +  (x3  -  x4)(xi  -  x2)  =  0. 


|(xi  -  x4)(x2  -  x3)|  =  |(x2  -  x4)(x3  -  xi)  +  (x3  -  X4)(X!  -  x2)\ 

^  \(x2  -  X4)(x3  -  xi)|  +  |(x3  -  x4)(xi  -  X2)\] 


Hence 
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6.  Deduce  Ptolemy’s  Theorem  concerning  cyclic  quadrilaterals  from  the 
fact  that  the  cross  ratios  of  four  concyclic  points  are  real.  [Use  the  same  identity 
as  in  the  last  example.] 

7.  If  z2  +  z'2  =  1,  then  the  points  z,  z!  are  ends  of  conjugate  diameters  of  an 
ellipse  whose  foci  are  the  points  1,  —1.  [If  CP,  CD  are  conjugate  semi-diameters 
of  an  ellipse  and  S,  H  its  foci,  then  CD  is  parallel  to  the  external  bisector  of 
the  angle  SPH,  and  SP  ■  HP  =  CD2.] 

8.  Prove  that  |a  +  b\ 2  +  |a  —  b\ 2  =  2{|a|2  +  |6|2}.  [This  is  the  analytical 
equivalent  of  the  geometrical  theorem  that,  if  M  is  the  middle  point  of  PQ ,  then 
OP2  +  OQ2  =  20  M2  +  2  A/P2.] 

9.  Deduce  from  Ex.  8  that 

|a  +  \J  o?  —  b2 1  +  |a  —  \J  a2  —  b2\  =  \a  +  b\  +  \a  —  b\. 

[If  a  +  V a2  —  b2  =  zi,  a  —  \J a2  —  b2  =  Z2,  we  have 

I  1 2  ,  |  1 2  1|  ,  1 2  ,  li  1 2  m  1 2  ,  ol  2  j2i 

Di|  T  1 22 1  —  21D  P  z2\  +  2U1  —  ^2!  —  2|o|  +  2|a  —  0  |, 

and  so 

d^i]  +  1 2^2 1 ) 2  =  2{|o|“  +  |  a2  —  b2 1  +  |6|2}  =  |a  +  b\2  +  |a  —  b\ 2  +  2|cU  —  b2\. 

Another  way  of  stating  the  result  is:  if  z  1  and  Z2  are  the  roots  of 
az2  +  2/3z  +  7  =  0,  then 

M  +  |^|  =  (l/l«l){(l  -p  +  V&y\)  +  (1-/9-  v/5r|)}.] 

10.  Show  that  the  necessary  and  sufficient  conditions  that  both  the  roots  of 
the  equation  z2  +  az  +  b  =  0  should  be  of  unit  modulus  are 

|a|  ^  2,  |6|  =  1,  am6  =  2ama. 

[The  amplitudes  have  not  necessarily  their  principal  values.] 

11.  If  x4  +  4aix3  +  6a2X2  +  4a3X  +  a4  =  0  is  an  equation  with  real  coefficients 
and  has  two  real  and  two  complex  roots,  concyclic  in  the  Argand  diagram,  then 


2  2  3 

CL 2  0>2  —  ^2^4  —  2(21(22^3  — —  0* 
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12.  The  four  roots  of  aox4+4aix3+6a2.x2+4a3X+a4  =  0  will  be  harmonically 
related  if 

2  2  3 

CL  0Q3  +  (X4CI4  +  CL2  —  Q-0O,2fl4  —  2010203  =  0. 

[Express  ^23,14^31,24^12,34,  where  Z23ii4  =  (zi-Z2)(zs-Z4)  +  (z1-z3)(z2-Z4) 
and  z  1,  Z2,  23,  Z4  are  the  roots  of  the  equation,  in  terms  of  the  coefficients.] 

13.  Imaginary  points  and  straight  lines.  Let  ax  +  by  +  c  =  0  be  an 
equation  with  complex  coefficients  (which  of  course  may  be  real  in  special  cases) . 

If  we  give  x  any  particular  real  or  complex  value,  we  can  find  the  correspond¬ 
ing  value  of  y.  The  aggregate  of  pairs  of  real  or  complex  values  of  x  and  y  which 
satisfy  the  equation  is  called  an  imaginary  straight  line ;  the  pairs  of  values  are 
called  imaginary  points ,  and  are  said  to  lie  on  the  line.  The  values  of  x  and  y 
are  called  the  coordinates  of  the  point  (x,  y).  When  x  and  y  are  real,  the  point  is 
called  a  real  point:  when  a,  b,  c  are  all  real  (or  can  be  made  all  real  by  division 
by  a  common  factor),  the  line  is  called  a  real  line.  The  points  x  =  a  +  j3i, 
y  =  7  +  5i  and  x  =  a  —  /3i,  y  =  7  —  5i  are  said  to  be  conjugate ;  and  so  are  the 
lines 

(. A  +  A'i)x  +  (B  +  B'i)y  +  C  +  C'i  =  0,  {A  —  A'i)x  +  (B  —  Bli)y  +  C  —  C'i  =  0. 

Verify  the  following  assertions: — every  real  line  contains  infinitely  many  pairs 
of  conjugate  imaginary  points;  an  imaginary  line  in  general  contains  one  and  only 
one  real  point;  an  imaginary  line  cannot  contain  a  pair  of  conjugate  imaginary 
points: — and  find  the  conditions  (a)  that  the  line  joining  two  given  imaginary 
points  should  be  real,  and  (6)  that  the  point  of  intersection  of  two  imaginary 
lines  should  be  real. 

14.  Prove  the  identities 

(x  +  y  +  z)(x  +  yoj 3  +  zw|)(x  +  yw\  +  zoj 3)  =  x3  +  y3  +  z3  -  3xyz, 

(x  +  y  +  z)(x  +  yu5  +  zu}£)(x  +  yu\  +  zu$)(x  +  ywf  +  zcu%)(x  +  yu$  +  zu5) 

=  x5  +  y5  +  z5  —  5x3yz  +  5  xy2z2. 

15.  Solve  the  equations 

x3  —  3  ax  +  (a3  +  1)  =  0,  x5  —  5  ax3  +  5  a2x  +  (a5  +  1)  =  0. 

16.  If  f(x)  =  ao  +  a\x  +  •  •  •  +  akXk,  then 

{f(x)  +  f{ux)  -\ - b  f(<jJn~lx)}/n  =  a0  +  anxn  +  a2nx2n  4 - b  aXnxXn, 
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co  being  any  root  of  xn  =  1  (except  x  =  1),  and  An  the  greatest  multiple  of  n 
contained  in  k.  Find  a  similar  formula  for  +  a^+nxn  +  aM+ 2nx2n  +  . . . . 

17.  If 

(1  +  x)n  =  Po+  PlX  +  P2X2  +  .  .  .  , 


n  being  a  positive  integer,  then 


Po  ~  P2  +  P4 - 


Pi- P3+P5 - 


18.  Sum  the  series 

X  X2  X^  2»h/3 

2!(n  —  2)!  +  5!(n  —  5)!  +  8!(n-8)!  +  +  (n  _  i)P 

n  being  a  multiple  of  3.  (Math.  Trip.  1899.) 

19.  If  t  is  a  complex  number  such  that  \t\  =  1,  then  the  point 
x  =  ( at  +  b)/(t  —  c )  describes  a  circle  as  t  varies,  unless  |c|  =  1,  when  it 
describes  a  straight  line. 

20.  If  t.  varies  as  in  the  last  example  then  the  point  x  =  at  +  (b/t)}  in 
general  describes  an  ellipse  whose  foci  are  given  by  x2  =  ab,  and  whose  axes  are 
|a|  +  \b\  and  |a|  —  \b\.  But  if  |a|  =  \b\  then  x  describes  the  finite  straight  line 
joining  the  points  —  yfab ,  yfab. 

21.  Prove  that  if  t  is  real  and  z  =  t2  —  1  +  \J f  4  —  t2.  then,  when  t2  <  1, 

2  is  represented  by  a  point  which  lies  on  the  circle  x2  +  y2  +  x  =  0.  Assuming 
that,  when  t2  >  1,  y/t4  —  t2  denotes  the  positive  square  root  of  t4  —  f2,  discuss 
the  motion  of  the  point  which  represents  z,  as  t  diminishes  from  a  large  positive 
value  to  a  large  negative  value.  (Math.  Trip.  1912.) 

22.  The  coefficients  of  the  transformation  2  =  (aZ  +  b ) /(cZ  +  d )  are  subject 
to  the  condition  ad  —  bc=  1.  Show  that,  if  c  /  0,  there  are  two  fixed  points  a,  fi, 
i.e.  points  unaltered  by  the  transformation,  except  when  (a  +  d)2  =  4,  when 
there  is  only  one  fixed  point  a;  and  that  in  these  two  cases  the  transformation 
may  be  expressed  in  the  forms 

z  —  a  r.Z  —  a  1  1 

- 2  =  K - - = - 1 -K. 

z  —  f5  Z  —  fi  z  —  a  Z  —  a 

Show  further  that,  if  c  =  0,  there  will  be  one  fixed  point  a  unless  a  =  d,  and 
that  in  these  two  cases  the  transformation  may  be  expressed  in  the  forms 


2  —  a  =  K(Z  —  a),  z  =  Z  +  K. 
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Finally,  if  a,  b,  c,  d  are  further  restricted  to  positive  integral  values  (including 
zero),  show  that  the  only  transformations  with  less  than  two  fixed  points  are  of 
the  forms  (1  / z)  =  (1  /Z)  +  K,  z  =  Z  +  K.  (Math.  Trip.  1911.) 

23.  Prove  that  the  relation  z  =  (1  +  Zi)/(Z  +  i)  transforms  the  part  of  the 

axis  of  x  between  the  points  z  =  1  and  z  =  —1  into  a  semicircle  passing  through 
the  points  Z  =  1  and  Z  =  —  1.  Find  all  the  figures  that  can  be  obtained  from 
the  originally  selected  part  of  the  axis  of  x  by  successive  applications  of  the 
transformation.  (Math.  Trip.  1912.) 

24.  If  2  =  2 Z  +  Z 2  then  the  circle  |Z|  =  1  corresponds  to  a  cardioid  in  the 
plane  of  2. 

25.  Discuss  the  transformation  2  =  \{Z  +  (1/Z)},  showing  in  particular  that 
to  the  circles  X 2  +  Y 2  =  a 2  correspond  the  confocal  ellipses 


26.  If  (2  +  l)2  =  4 /Z  then  the  unit  circle  in  the  2-plane  corresponds  to  the 
parabola  R  cos2  ^0  =  1  in  the  Z-plane,  and  the  inside  of  the  circle  to  the  outside 
of  the  parabola. 

27.  Show  that,  by  means  of  the  transformation  2  =  {(Z  —  ci) / (Z  +  ci)}2 ,  the 
upper  half  of  the  2-plane  may  be  made  to  correspond  to  the  interior  of  a  certain 
semicircle  in  the  Z-plane. 

28.  If  2  =  Z2  —  1,  then  as  2  describes  the  circle  1 2|  =  k.  the  two  corresponding 
positions  of  Z  each  describe  the  Cassinian  oval  p\p^  =  k,  where  p\ ,  P2  are  the 
distances  of  Z  from  the  points  —1,  1.  Trace  the  ovals  for  different  values  of  k. 

29.  Consider  the  relation  az2  +  2 hzZ  +  bZ2  +  2 gz  +  2 fZ  +  c  =  0.  Show  that 
there  are  two  values  of  Z  for  which  the  corresponding  values  of  2  are  equal,  and 
vice  versa.  We  call  these  the  branch  points  in  the  Z  and  2-planes  respectively. 
Show  that,  if  2  describes  an  ellipse  whose  foci  are  the  branch  points,  then  so 
does  Z. 

[We  can,  without  loss  of  generality,  take  the  given  relation  in  the  form 

22  +  2 zZ  cos  oj  +  Z2  =  1  : 

the  reader  should  satisfy  himself  that  this  is  the  case.  The  branch  points  in 
either  plane  are  cosec  uj  and  —  cosec  cu.  An  ellipse  of  the  form  specified  is  given 
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|  z  +  cosec  uj\  +  \z  —  cosec  w|  =  C, 
where  C  is  a  constant.  This  is  equivalent  (Ex.  9)  to 


\z  +  v  z2  —  cosec2  oj\  +  \z  —  y  z2  —  cosec2  u\  = 


Express  this  in  terms  of  Z .] 

30.  If  z  =  aZ'm  +  bZn,  where  m,  n  are  positive  integers  and  a,  b  real,  then 
as  Z  describes  the  unit  circle,  z  describes  a  hypo-  or  epi-cycloid. 

31.  Show  that  the  transformation 

(u  T  di)Zo  T  b 
cZq  —  (a  —  di)  ’ 

where  a,  b ,  c,  d  are  real  and  a2  +  d?  +  be  >  0,  and  Zq  denotes  the  conjugate  of  Z, 
is  equivalent  to  an  inversion  with  respect  to  the  circle 

c(x2  +  y2)  —  2  ax  —  2  dy  —  6  =  0. 


What  is  the  geometrical  interpretation  of  the  transformation  when 

a2  +  d2  +  be  <  0? 


where  c  is  rational  and  0  <  c  <  1 ,  transforms  the  circle 
of  a  circular  lune  of  angle  n/c. 


1  into  the  boundary 


CHAPTER  IV 


LIMITS  OF  FUNCTIONS  OF  A  POSITIVE  INTEGRAL  VARIABLE 

50.  Functions  of  a  positive  integral  variable.  In  Chapter  II 
we  discussed  the  notion  of  a  function  of  a  real  variable  x,  and  illustrated 
the  discussion  by  a  large  number  of  examples  of  such  functions.  And  the 
reader  will  remember  that  there  was  one  important  particular  with  regard 
to  which  the  functions  which  we  took  as  illustrations  differed  very  widely. 
Some  were  defined  for  all  values  of  x,  some  for  rational  values  only,  some 
for  integral  values  only,  and  so  on. 

Consider,  for  example,  the  following  functions:  (i)  x,  (ii)  yT,  (iii)  the  de¬ 
nominator  of  x,  (iv)  the  square  root  of  the  product  of  the  numerator  and  the 
denominator  of  x,  (v)  the  largest  prime  factor  of  x,  (vi)  the  product  of  yT 
and  the  largest  prime  factor  of  x,  (vii)  the  xth  prime  number,  (viii)  the  height 
measured  in  inches  of  convict  x  in  Dartmoor  prison. 

Then  the  aggregates  of  values  of  x  for  which  these  functions  are  defined  or, 
as  we  may  say,  the  fields  of  definition  of  the  functions,  consist  of  (i)  all  values 
of  x ,  (ii)  all  positive  values  of  x,  (iii)  all  rational  values  of  x,  (iv)  all  positive 
rational  values  of  x,  (v)  all  integral  values  of  x.  (vi),  (vii)  all  positive  integral 
values  of  x.  (viii)  a  certain  number  of  positive  integral  values  of  x,  viz.,  1,  2, 

. . . ,  N,  where  N  is  the  total  number  of  convicts  at  Dartmoor  at  a  given  moment 
of  time.* 

Now  let  us  consider  a  function,  such  as  (vii)  above,  which  is  defined 
for  all  positive  integral  values  of  x  and  no  others.  This  function  may  be 
regarded  from  two  slightly  different  points  of  view.  We  may  consider  it, 
as  has  so  far  been  our  custom,  as  a  function  of  the  real  variable  x  defined 
for  some  only  of  the  values  of  x,  viz.  positive  integral  values,  and  say  that 
for  all  other  values  of  x  the  definition  fails.  Or  we  may  leave  values  of  x 

*In  the  last  case  N  depends  on  the  time,  and  convict  x,  where  x  has  a  definite 
value,  is  a  different  individual  at  different  moments  of  time.  Thus  if  we  take  different 
moments  of  time  into  consideration  we  have  a  simple  example  of  a  function  y  =  F(x,  t ) 
of  two  variables,  defined  for  a  certain  range  of  values  of  t ,  viz.  from  the  time  of  the 
establishment  of  Dartmoor  prison  to  the  time  of  its  abandonment,  and  for  a  certain 
number  of  positive  integral  values  of  x,  this  number  varying  with  t. 
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other  than  positive  integral  values  entirely  out  of  account,  and  regard  our 
function  as  a  function  of  the  positive  integral  variable  n,  whose  values  are 
the  positive  integers 

1,2,  3,  4,  .... 

In  this  case  we  may  write 

V  =  <t>{n) 

and  regard  y  now  as  a  function  of  n  defined  for  all  values  of  n. 

It  is  obvious  that  any  function  of  x  defined  for  all  values  of  x  gives  rise 
to  a  function  of  n  defined  for  all  values  of  n.  Thus  from  the  function  y  =  x2 
we  deduce  the  function  y  =  n 2  by  merely  omitting  from  consideration  all 
values  of  x  other  than  positive  integers,  and  the  corresponding  values  of  y. 
On  the  other  hand  from  any  function  of  n  we  can  deduce  any  number  of 
functions  of  x  by  merely  assigning  values  to  y,  corresponding  to  values  of  x 
other  than  positive  integral  values,  in  any  way  we  please. 

51.  Interpolation.  The  problem  of  determining  a  function  of  x  which 
shall  assume,  for  all  positive  integral  values  of  x,  values  agreeing  with  those  of  a 
given  function  of  n,  is  of  extreme  importance  in  higher  mathematics.  It  is  called 
the  problem  of  functional  interpolation. 

Were  the  problem  however  merely  that  of  finding  some  function  of  x  to  fulfil 
the  condition  stated,  it  would  of  course  present  no  difficulty  whatever.  We  could, 
as  explained  above,  simply  fill  in  the  missing  values  as  we  pleased:  we  might 
indeed  simply  regard  the  given  values  of  the  function  of  n  as  all  the  values  of 
the  function  of  x  and  say  that  the  definition  of  the  latter  function  failed  for  all 
other  values  of  x.  But  such  purely  theoretical  solutions  are  obviously  not  what 
is  usually  wanted.  What  is  usually  wanted  is  some  formula  involving  x  (of  as 
simple  a  kind  as  possible)  which  assumes  the  given  values  for  x  =  1 .  2,  .... 

In  some  cases,  especially  when  the  function  of  n  is  itself  defined  by  a  formula, 
there  is  an  obvious  solution.  If  for  example  y  =  cj>(n),  where  4>(n)  is  a  function 
of  n,  such  as  n2  or  cosri7r,  which  would  have  a  meaning  even  were  n  not  a 
positive  integer,  we  naturally  take  our  function  of  x  to  be  y  =  4>(x).  But  even 
in  this  very  simple  case  it  is  easy  to  write  down  other  almost  equally  obvious 
solutions  of  the  problem.  For  example 


y  =  4>{x)  +  sin.T7r 
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assumes  the  value  </>(n)  for  x  =  n,  since  sinn7r  =  0. 

In  other  cases  (j>(n)  may  be  defined  by  a  formula,  such  as  (— l)n,  which  ceases 
to  define  for  some  values  of  x  (as  here  in  the  case  of  fractional  values  of  x  with 
even  denominators,  or  irrational  values).  But  it  may  be  possible  to  transform 
the  formula  in  such  a  way  that  it  does  define  for  all  values  of  x.  In  this  case,  for 
example, 

(— l)n  =  cos  wr, 

if  n  is  an  integer,  and  the  problem  of  interpolation  is  solved  by  the  func¬ 
tion  COS  XTT. 

In  other  cases  <f>(x)  may  be  defined  for  some  values  of  x  other  than  positive 
integers,  but  not  for  all.  Thus  from  y  =  n11  we  are  led  to  y  =  xx.  This 
expression  has  a  meaning  for  some  only  of  the  remaining  values  of  x.  If  for 
simplicity  we  confine  ourselves  to  positive  values  of  x,  then  xx  has  a  meaning  for 
all  rational  values  of  x,  in  virtue  of  the  definitions  of  fractional  powers  adopted 
in  elementary  algebra.  But  when  x  is  irrational  xx  has  (so  far  as  we  are  in  a 
position  to  say  at  the  present  moment)  no  meaning  at  all.  Thus  in  this  case  the 
problem  of  interpolation  at  once  leads  us  to  consider  the  question  of  extending 
our  definitions  in  such  a  way  that  xx  shall  have  a  meaning  even  when  x  is 
irrational.  We  shall  see  later  on  how  the  desired  extension  may  be  effected. 

Again,  consider  the  case  in  which 

y  =  1  •  2  . . .  n  =  n\. 

In  this  case  there  is  no  obvious  formula  in  x  which  reduces  to  nl  for  x  =  n,  as 
x\  means  nothing  for  values  of  x  other  than  the  positive  integers.  This  is  a  case 
in  which  attempts  to  solve  the  problem  of  interpolation  have  led  to  important 
advances  in  mathematics.  For  mathematicians  have  succeeded  in  discovering  a 
function  (the  Gamma-function)  which  possesses  the  desired  property  and  many 
other  interesting  and  important  properties  besides. 

52.  Finite  and  infinite  classes.  Before  we  proceed  further  it  is 

necessary  to  make  a  few  remarks  about  certain  ideas  of  an  abstract  and 
logical  nature  which  are  of  constant  occurrence  in  Pure  Mathematics. 

In  the  first  place,  the  reader  is  probably  familiar  with  the  notion  of  a 
class.  It  is  unnecessary  to  discuss  here  any  logical  difficulties  which  may 
be  involved  in  the  notion  of  a  ‘class’:  roughly  speaking  we  may  say  that 
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a  class  is  the  aggregate  or  collection  of  all  the  entities  or  objects  which 
possess  a  certain  property,  simple  or  complex.  Thus  we  have  the  class  of 
British  subjects,  or  members  of  Parliament,  or  positive  integers,  or  real 
numbers. 

Moreover,  the  reader  has  probably  an  idea  of  what  is  meant  by  a  finite 
or  infinite  class.  Thus  the  class  of  British  subjects  is  a  finite  class:  the 
aggregate  of  all  British  subjects,  past,  present,  and  future,  has  a  finite 
number  n,  though  of  course  we  cannot  tell  at  present  the  actual  value  of  n. 
The  class  of  present  British  subjects,  on  the  other  hand,  has  a  number  n 
which  could  be  ascertained  by  counting,  were  the  methods  of  the  census 
effective  enough. 

On  the  other  hand  the  class  of  positive  integers  is  not  finite  but  infinite. 
This  may  be  expressed  more  precisely  as  follows.  If  n  is  any  positive 
integer,  such  as  1000,  1,000,000  or  any  number  we  like  to  think  of,  then 
there  are  more  than  n  positive  integers.  Thus,  if  the  number  we  think 
of  is  1,000,000,  there  are  obviously  at  least  1,000,001  positive  integers. 
Similarly  the  class  of  rational  numbers,  or  of  real  numbers,  is  infinite.  It  is 
convenient  to  express  this  by  saying  that  there  are  an  infinite  number 
of  positive  integers,  or  rational  numbers,  or  real  numbers.  But  the  reader 
must  be  careful  always  to  remember  that  by  saying  this  we  mean  simply 
that  the  class  in  question  has  not  a  finite  number  of  members  such  as  1000 
or  1,000,000. 

53.  Properties  possessed  by  a  function  of  n  for  large  values 

of  n.  We  may  now  return  to  the  ‘functions  of  n’  which  we  were  discussing 
in  §§  50-51.  They  have  many  points  of  difference  from  the  functions  of  x 
which  we  discussed  in  Chap.  II.  But  there  is  one  fundamental  character¬ 
istic  which  the  two  classes  of  functions  have  in  common:  the  values  of  the 
variable  for  which  they  are  defined  form  an  infinite  class.  It  is  this  fact 
which  forms  the  basis  of  all  the  considerations  which  follow  and  which,  as 
we  shall  see  in  the  next  chapter,  apply,  mutatis  mutandis,  to  functions  of  x 
as  well. 

Suppose  that  0(n)  is  any  function  of  n,  and  that  P  is  any  property 
which  4>(n)  may  or  may  not  have,  such  as  that  of  being  a  positive  integer 
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or  of  being  greater  than  1.  Consider,  for  each  of  the  values  n  =  1,  2, 
3,  . . . ,  whether  has  the  property  P  or  not.  Then  there  are  three 
possibilities: — 

(a)  (f>(n)  may  have  the  property  P  for  all  values  of  n,  or  for  all  values 
of  n  except  a  finite  number  N  of  such  values: 

( b )  (j){n)  may  have  the  property  for  no  values  of  n,  or  only  for  a  finite 
number  N  of  such  values: 

(c)  neither  (a)  nor  ( b )  may  be  true. 

If  ( b )  is  true,  the  values  of  n  for  which  (f>(n)  has  the  property  form 
a  finite  class.  If  (a)  is  true,  the  values  of  n  for  which  (f>(n)  has  not  the 
property  form  a  finite  class.  In  the  third  case  neither  class  is  finite.  Let  us 
consider  some  particular  cases. 

(1)  Let  <f>(n )  =  n,  and  let  P  be  the  property  of  being  a  positive  integer. 
Then  4>{n)  has  the  property  P  for  all  values  of  n. 

If  on  the  other  hand  P  denotes  the  property  of  being  a  positive  integer 
greater  than  or  equal  to  1000,  then  (f>(n)  has  the  property  for  all  values  of  n 
except  a  finite  number  of  values  of  n,  viz.  1,2,3,...,  999.  In  either  of  these 
cases  (a)  is  true. 

(2)  If  4>{n)  =  n,  and  P  is  the  property  of  being  less  than  1000,  then  ( b )  is 
true. 

(3)  If  4>(n)  =  n,  and  P  is  the  property  of  being  odd,  then  (c)  is  true.  For 
<j>(n)  is  odd  if  n  is  odd  and  even  if  n  is  even,  and  both  the  odd  and  the  even 
values  of  n  form  an  infinite  class. 

Example.  Consider,  in  each  of  the  following  cases,  whether  (a),  (6),  or  (c) 
is  true: 

(i)  4>(n)  =  n,  P  being  the  property  of  being  a  perfect  square, 

(ii)  4>(n)  =  pn,  where  pn  denotes  the  nth  prime  number,  P  being  the 

property  of  being  odd, 

(iii)  4>(n)  =  pn,  P  being  the  property  of  being  even, 

(iv)  4>(n)  =  pn,  P  being  the  property  4>{n)  >  n, 

(v)  4>(n)  =  1  —  (— l)n(l/n),  P  being  the  property  (j>(n)  <  1, 

(vi)  4>(n)  =  1  —  (— l)n(l /n),  P  being  the  property  4>{n)  <  2, 

(vii)  (j>{n)  =  1000{1  +  (— l)n}/n,  P  being  the  property  < f>(n)  <  1, 
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(viii)  4>(n)  =  1/n,  P  being  the  property  4>(n)  <  .001, 

(ix)  4>(n)  =  (— 1  )n/n,  P  being  the  property  \4>(n)\  <  .001, 

(x)  0(n)  =  10,000/n,  or  (— l)n10,000/n,  P  being  either  of  the  properties 

4>(n)  <  .001  or  \4>(n)\  <  .001, 

(xi)  4>(n)  =  (n  —  l)/(n  +  1),  P  being  the  property  1  —  f(n )  <  .0001. 

54.  Let  us  now  suppose  that  (pin)  and  P  are  such  that  the  asser¬ 
tion  (a)  is  true,  i.e.  that  (pin)  has  the  property  P,  if  not  for  all  values  of  n, 
at  any  rate  for  all  values  of  n  except  a  finite  number  N  of  such  values.  We 
may  denote  these  exceptional  values  by 

n i  ,  •  •  ■ , 

There  is  of  course  no  reason  why  these  N  values  should  be  the  first  N  values 
1,  2,  . . . ,  N,  though,  as  the  preceding  examples  show,  this  is  frequently  the 
case  in  practice.  But  whether  this  is  so  or  not  we  know  that  (j){n)  has  the 
property  P  if  n  >  n m-  Thus  the  nth  prime  is  odd  if  n  >  2,  n  =  2  being 
the  only  exception  to  the  statement;  and  1/n  <  .001  if  n  >  1000,  the  first 
1000  values  of  n  being  the  exceptions;  and 

1000(1  +  (-l)"}/n  <  1 

if  n  >  2000,  the  exceptional  values  being  2,  4,  6,  . . . ,  2000.  That  is  to  say, 
in  each  of  these  cases  the  property  is  possessed  for  all  values  of  n  from  a 
definite  value  onwards. 

We  shall  frequently  express  this  by  saying  that  d(n)  has  the  property 
for  large,  or  very  large,  or  all  sufficiently  large  values  of  n.  Thus  when 
we  say  that  d(n)  has  the  property  P  (which  will  as  a  rule  be  a  property 
expressed  by  some  relation  of  inequality)  for  large  values  of  n,  what  we 
mean  is  that  we  can  determine  some  definite  number,  no  say,  such  that 
fin)  has  the  property  for  all  values  of  n  greater  than  or  equal  to  no-  This 
number  no,  in  the  examples  considered  above,  may  be  taken  to  be  any 
number  greater  than  n tv,  the  greatest  of  the  exceptional  numbers:  it  is 
most  natural  to  take  it  to  be  hn  +  1. 
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Thus  we  may  say  that  ‘all  large  primes  are  odd’,  or  that  ‘1  jn  is  less 
than  .001  for  large  values  of  n\  And  the  reader  must  make  himself  familiar 
with  the  use  of  the  word  large  in  statements  of  this  kind.  Large  is  in 
fact  a  word  which,  standing  by  itself,  has  no  more  absolute  meaning  in 
mathematics  than  in  the  language  of  common  life.  It  is  a  truism  that  in 
common  life  a  number  which  is  large  in  one  connection  is  small  in  another; 
6  goals  is  a  large  score  in  a  football  match,  but  6  runs  is  not  a  large  score  in  a 
cricket  match;  and  400  runs  is  a  large  score,  but  £400  is  not  a  large  income: 
and  so  of  course  in  mathematics  large  generally  means  large  enough ,  and 
what  is  large  enough  for  one  purpose  may  not  be  large  enough  for  another. 

We  know  now  what  is  meant  by  the  assertion  l(j){n )  has  the  property  P 
for  large  values  of  n\  It  is  with  assertions  of  this  kind  that  we  shall  be 
concerned  throughout  this  chapter. 

55.  The  phrase  ‘n  tends  to  infinity’.  There  is  a  somewhat  dif¬ 
ferent  way  of  looking  at  the  matter  which  it  is  natural  to  adopt.  Suppose 
that  n  assumes  successively  the  values  1,  2,  3,  ... .  The  word  ‘successively’ 
naturally  suggests  succession  in  time,  and  we  may  suppose  n,  if  we  like,  to 
assume  these  values  at  successive  moments  of  time  ( e.g .  at  the  beginnings 
of  successive  seconds).  Then  as  the  seconds  pass  n  gets  larger  and  larger 
and  there  is  no  limit  to  the  extent  of  its  increase.  However  large  a  number 
we  may  think  of  (e.g.  2,147,483,647),  a  time  will  come  when  n  has  become 
larger  than  this  number. 

It  is  convenient  to  have  a  short  phrase  to  express  this  unending  growth 
of  n,  and  we  shall  say  that  n  tends  to  infinity,  or  n  — »  oo,  this  last  symbol 
being  usually  employed  as  an  abbreviation  for  ‘infinity’.  The  phrase  ‘tends 
to’  like  the  word  ‘successively’  naturally  suggests  the  idea  of  change  in 
time,  and  it  is  convenient  to  think  of  the  variation  of  n  as  accomplished 
in  time  in  the  manner  described  above.  This  however  is  a  mere  matter  of 
convenience.  The  variable  n  is  a  purely  logical  entity  which  has  in  itself 
nothing  to  do  with  time. 

The  reader  cannot  too  strongly  impress  upon  himself  that  when  we 
say  that  n  ‘tends  to  oo’  we  mean  simply  that  n  is  supposed  to  assume  a 
series  of  values  which  increase  continually  and  without  limit.  There  is  no 
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number  ‘infinity’:  such  an  equation  as 


n  —  og 


is  as  it  stands  absolutely  meaningless:  n  cannot  be  equal  to  oo,  because 
‘equal  to  oo’  means  nothing.  So  far  in  fact  the  symbol  oo  means  nothing 
at  all  except  in  the  one  phrase  ‘tends  to  oo’,  the  meaning  of  which  we  have 
explained  above.  Later  on  we  shall  learn  how  to  attach  a  meaning  to  other 
phrases  involving  the  symbol  oo,  but  the  reader  will  always  have  to  bear 
in  mind 

(1)  that  oo  by  itself  means  nothing,  although  phrases  containing  it 
sometimes  mean  something, 

(2)  that  in  every  case  in  which  a  phrase  containing  the  symbol  oo 
means  something  it  will  do  so  simply  because  we  have  previously  attached 
a  meaning  to  this  particular  phrase  by  means  of  a  special  definition. 

Now  it  is  clear  that  if  (j){n)  has  the  property  P  for  large  values  of  n, 
and  if  n  ‘tends  to  oo’,  in  the  sense  which  we  have  just  explained,  then 
n  will  ultimately  assume  values  large  enough  to  ensure  that  b(n)  has  the 
property  P.  And  so  another  way  of  putting  the  question  ‘what  properties 
has  (j){n)  for  sufficiently  large  values  of  n?’  is  ‘how  does  0(n)  behave  as 
n  tends  to  oo?’ 

56.  The  behaviour  of  a  function  of  n  as  n  tends  to  infinity. 

We  shall  now  proceed,  in  the  light  of  the  remarks  made  in  the  preceding 
sections,  to  consider  the  meaning  of  some  kinds  of  statements  which 
are  perpetually  occurring  in  higher  mathematics.  Let  us  consider,  for 
example,  the  two  following  statements:  (a)  1/n  is  small  for  large  values 
of  n,  ( b )  1  —  (1/n)  is  nearly  equal  to  1  for  large  values  of  n.  Obvious 
as  they  may  seem,  there  is  a  good  deal  in  them  which  will  repay  the 
reader’s  attention.  Let  us  take  (a)  first,  as  being  slightly  the  simpler. 

We  have  already  considered  the  statement  ‘1/n  is  less  than  .01  for  large 
values  of  n\  This,  we  saw,  means  that  the  inequality  1/n  <  .01  is  true  for 
all  values  of  n  greater  than  some  definite  value,  in  fact  greater  than  100. 
Similarly  it  is  true  that  ‘1/n  is  less  than  .0001  for  large  values  of  n’:  in 
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fact  1  /n  <  .0001  if  n  >  10,000.  And  instead  of  .01  or  .0001  we  might  take 
.000  001  or  .000  000  01,  or  indeed  any  positive  number  we  like. 

It  is  obviously  convenient  to  have  some  way  of  expressing  the  fact  that 
any  such  statement  as  ‘1/n  is  less  than  .01  for  large  values  of  n’  is  true, 
when  we  substitute  for  .01  any  smaller  number,  such  as  .0001  or  .000  001 
or  any  other  number  we  care  to  choose.  And  clearly  we  can  do  this  by 
saying  that  ‘ however  small  e  may  be  (provided  of  course  it  is  positive), 
then  1/n  <  e  for  sufficiently  large  values  of  n\  That  this  is  true  is  obvious. 
For  1/n  <  e  if  n  >  1/e,  so  that  our  ‘sufficiently  large’  values  of  n  need 
only  all  be  greater  than  1/e.  The  assertion  is  however  a  complex  one,  in 
that  it  really  stands  for  the  whole  class  of  assertions  which  we  obtain  by 
giving  to  e  special  values  such  as  .01.  And  of  course  the  smaller  e  is,  and 
the  larger  1/e,  the  larger  must  be  the  least  of  the  ‘sufficiently  large’  values 
of  n:  values  which  are  sufficiently  large  when  e  has  one  value  are  inadequate 
when  it  has  a  smaller. 

The  last  statement  italicised  is  what  is  really  meant  by  the  state¬ 
ment  (a),  that  1/n  is  small  when  n  is  large.  Similarly  ( b )  really  means 
“ if  0(n)  =  1  —  (1/n),  then  the  statement  1  —  0(n)  <  e  for  sufficiently 
large  values  of  n’  is  true  whatever  positive  value  ( such  as  .01  or  .0001)  we 
attribute  to  e”.  That  the  statement  ( b )  is  true  is  obvious  from  the  fact  that 
1  —  0(n)  =  1/n. 

There  is  another  way  in  which  it  is  common  to  state  the  facts  expressed 
by  the  assertions  (a)  and  ( b ).  This  is  suggested  at  once  by  §  55.  Instead  of 
saying  ‘1/n  is  small  for  large  values  of  n’  we  say  ‘1/n  tends  to  0  as  n  tends 
to  oo’.  Similarly  we  say  that  ‘1  —  (1/n)  tends  to  1  as  n  tends  to  oo’:  and 
these  statements  are  to  be  regarded  as  precisely  equivalent  to  (a)  and  ( b ). 
Thus  the  statements 

‘1/n  is  small  when  n  is  large’, 

‘1/n  tends  to  0  as  n  tends  to  oo’, 

are  equivalent  to  one  another  and  to  the  more  formal  statement 

‘if  e  is  any  positive  number,  however  small,  then  1/n  <  e 
for  sufficiently  large  values  of  n’, 
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or  to  the  still  more  formal  statement 

‘if  e  is  any  positive  number,  however  small,  then  we  can  find 
a  number  no  such  that  1  jn  <  e  for  all  values  of  n  greater  than 
or  equal  to  no’. 

The  number  no  which  occurs  in  the  last  statement  is  of  course  a  function 
of  e.  We  shall  sometimes  emphasize  this  fact  by  writing  no  in  the  form  no(e). 

The  reader  should  imagine  himself  confronted  by  an  opponent  who  questions 
the  truth  of  the  statement.  He  would  name  a  series  of  numbers  growing  smaller 
and  smaller.  He  might  begin  with  .001.  The  reader  would  reply  that  1/n  <  .001 
as  soon  as  n  >  1000.  The  opponent  would  be  bound  to  admit  this,  but  would 
try  again  with  some  smaller  number,  such  as  .000  000  1.  The  reader  would  reply 
that  1/n  <  .000  0001  as  soon  as  n  >  10,000,000:  and  so  on.  In  this  simple  case 
it  is  evident  that  the  reader  would  always  have  the  better  of  the  argument. 

We  shall  now  introduce  yet  another  way  of  expressing  this  property  of 
the  function  1/n.  We  shall  say  that  'the  limit  of  1/n  as  n  tends  to  oo 
is  O’,  a  statement  which  we  may  express  symbolically  in  the  form 

,  1 
lim  —  =  0, 

n— >oo  Tl 

or  simply  lim(l/n)  =  0.  We  shall  also  sometimes  write  ‘1/n  — y  0  as 
n  -A  oo’,  which  may  be  read  ‘1/n  tends  to  0  as  n  tends  to  oo’;  or  simply 
‘1/n  — y  O’.  In  the  same  way  we  shall  write 

lim  (l  —  —  )  =1,  lim  (  1  —  —  )  =1, 

ih  oo  \  n  J  \  n  J 


or  1  —  (1/n)  -A  1. 

57.  Now  let  us  consider  a  different  example:  let  0(n)  =  n2.  Then 
‘n2  is  large  when  n  is  large' .  This  statement  is  equivalent  to  the  more 
formal  statements 
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‘if  A  is  any  positive  number,  however  large,  then  n 2  >  A 
for  sufficiently  large  values  of  n\ 

‘we  can  find  a  number  no  (A)  such  that  n2  >  A  for  all  values 
of  n  greater  than  or  equal  to  n0(A)’. 

And  it  is  natural  in  this  case  to  say  that  ‘n2  tends  to  oo  as  n  tends  to  oo’, 
or  ‘n2  tends  to  oo  with  n\  and  to  write 

n2  — >  oo. 

Finally  consider  the  function  (f){n)  =  —  n2.  In  this  case  < j){n )  is  large, 
but  negative,  when  n  is  large,  and  we  naturally  say  that  ‘—  n2  tends  to  — oo 
as  n  tends  to  oo’  and  write 


— n2  — >■  — oo. 

And  the  use  of  the  symbol  —  oo  in  this  sense  suggests  that  it  will  sometimes 
be  convenient  to  write  n2  — »  +oo  for  n2  — >  oo  and  generally  to  use  +oo 
instead  of  oo,  in  order  to  secure  greater  uniformity  of  notation. 

But  we  must  once  more  repeat  that  in  all  these  statements  the  symbols 
oo,  +oo,  —  oo  mean  nothing  whatever  by  themselves,  and  only  acquire  a 
meaning  when  they  occur  in  certain  special  connections  in  virtue  of  the 
explanations  which  we  have  just  given. 

58.  Definition  of  a  limit.  After  the  discussion  which  precedes  the 
reader  should  be  in  a  position  to  appreciate  the  general  notion  of  a  limit. 
Roughly  we  may  say  that  (f>(n)  tends  to  a  limit  l  as  n  tends  to  oo  if  <f>(n)  is 
nearly  equal  to  l  when  n  is  large.  But  although  the  meaning  of  this  state¬ 
ment  should  be  clear  enough  after  the  preceding  explanations,  it  is  not,  as 
it  stands,  precise  enough  to  serve  as  a  strict  mathematical  definition.  It  is, 
in  fact,  equivalent  to  a  whole  class  of  statements  of  the  type  ‘/or  sufficiently 
large  values  of  n,  <f>(n)  differs  from  l  by  less  than  e\  This  statement  has  to 
be  true  for  e  =  .01  or  .0001  or  any  positive  number;  and  for  any  such  value 
of  e  it  has  to  be  true  for  any  value  of  n  after  a  certain  definite  value  n0(e), 
though  the  smaller  e  is  the  larger,  as  a  rule,  will  be  this  value  no(e). 
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We  accordingly  frame  the  following  formal  definition: 

Definition  I.  The  function  <f>{ri)  is  said  to  tend  to  the  limit  l  as  n  tends 
to  oo,  if,  however  small  be  the  positive  number  e,  0(n)  differs  from  l  by 
less  than  e  for  sufficiently  large  values  of  n;  that  is  to  say  if,  however  small 
be  the  positive  number  e,  we  can  determine  a  number  7io(e)  corresponding 
to  e,  such  that  0(n)  differs  from  l  by  less  than  e  for  all  values  of  n  greater 
than  or  equal  to  n0(e). 

It  is  usual  to  denote  the  difference  between  <f>(n)  and  /,  taken  positively, 
by  1 <f>(n)  —l\.  It  is  equal  to  c j>(n)  —  l  or  to  l  —  (f(n),  whichever  is  positive,  and 
agrees  with  the  definition  of  the  modulus  of  <fr(ri)  —  /,  as  given  in  Chap.  Ill, 
though  at  present  we  are  only  considering  real  values,  positive  or  negative. 

With  this  notation  the  definition  may  be  stated  more  shortly  as  follows: 
Lif,  given  any  positive  number,  e,  however  small,  we  can  find  n0(e)  so  that 
1 <f>{n)  —  l |  <  e  when  n  ^  no(e),  then  we  say  that  <f(n)  tends  to  the  limit  l 
as  n  tends  to  oo,  and  write 


lim  <p(n)  =  /’. 

71— ^OO 


Sometimes  we  may  omit  the  ‘n  — >  oo’;  and  sometimes  it  is  convenient,  for 
brevity,  to  write  f>(n)  — >  l. 

The  reader  will  find  it  instructive  to  work  out,  in  a  few  simple  cases,  the 
explicit  expression  of  no  as  a  function  of  e.  Thus  if  4>(n)  =  1/n  then  l  =  0,  and 
the  condition  reduces  to  1/n  <  e  for  n  [k  no,  which  is  satisfied  if  no  =  1  +  [1/e]-* 
There  is  one  and  only  one  case  in  which  the  same  no  will  do  for  all  values  of  e. 
If,  from  a  certain  value  N  of  n  onwards,  4>(n)  is  constant,  say  equal  to  C,  then  it 
is  evident  that  </(n)  —  C  =  0  for  n  [k  N,  so  that  the  inequality  \f{n)  —  C  \  <  e  is 
satisfied  for  nf.N  and  all  positive  values  of  e.  And  if  \4>(n)  —  l\  <  e  for  nf.  N 
and  all  positive  values  of  e,  then  it  is  evident  that  =  l  when  n  ^  N,  so  that 
4>(n)  is  constant  for  all  such  values  of  n. 


*Here  and  henceforward  we  shall  use  [x]  in  the  sense  of  Chap.  II,  i.e.  as  the  greatest 
integer  not  greater  than  x. 
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59.  The  definition  of  a  limit  may  be  illustrated  geometrically  as  fol¬ 
lows.  The  graph  of  <f>(n)  consists  of  a  number  of  points  corresponding  to 
the  values  n  —  1,  2,  3,  ... . 

Draw  the  line  y  =  /,  and  the  parallel  lines  y  =  l  —  e,  y  =  l  +  e  at 
distance  e  from  it.  Then 

lim  <fi(n)  =  /, 

n— >oo 

if,  when  once  these  lines  have  been  drawn,  no  matter  how  close  they  may 


y  =  l  +  e 
y  =  l-e 


O  12  3  n0  X 

Fig.  27. 

be  together,  we  can  always  draw  a  line  x  =  n0,  as  in  the  figure,  in  such  a 
way  that  the  point  of  the  graph  on  this  line,  and  all  points  to  the  right  of 
it,  lie  between  them.  We  shall  find  this  geometrical  way  of  looking  at  our 
definition  particularly  useful  when  we  come  to  deal  with  functions  defined 
for  all  values  of  a  real  variable  and  not  merely  for  positive  integral  values. 

60.  So  much  for  functions  of  n  which  tend  to  a  limit  as  n  tends  to  oo. 
We  must  now  frame  corresponding  definitions  for  functions  which,  like  the 
functions  n 2  or  —  n2,  tend  to  positive  or  negative  infinity.  The  reader  should 
by  now  find  no  difficulty  in  appreciating  the  point  of 

Definition  II.  The  function  <f>(n)  is  said  to  tend  to  +oo  (positive 
infinity)  with  n,  if,  when  any  number  A,  however  large,  is  assigned,  we 
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can  determine  n0( A)  so  that  <f>(n)  >  A  when  n  A  n0(A);  that  is  to  say  if, 
however  large  A  may  be,  (f>(n)  >  A  for  sufficiently  large  values  of  n. 

Another,  less  precise,  form  of  statement  is  ‘if  we  can  make  <f>(n)  as  large 
as  we  please  by  sufficiently  increasing  n\  This  is  open  to  the  objection  that 
it  obscures  a  fundamental  point,  viz.  that  (f>(n)  must  be  greater  than  A  for 
all  values  of  n  such  that  n  A  n0( A),  and  not  merely  for  some  such  values. 
But  there  is  no  harm  in  using  this  form  of  expression  if  we  are  clear  what 
it  means. 

When  (f>(n)  tends  to  +oo  we  write 

0(n)  -A  Too. 

We  may  leave  it  to  the  reader  to  frame  the  corresponding  definition  for 
functions  which  tend  to  negative  infinity. 

61.  Some  points  concerning  the  definitions.  The  reader  should 
be  careful  to  observe  the  following  points. 

(1)  We  may  obviously  alter  the  values  of  (f>(n)  for  any  finite  number 
of  values  of  n,  in  any  way  we  please,  without  in  the  least  affecting  the 
behaviour  of  (f>(n)  as  n  tends  to  oo.  For  example  1/n  tends  to  0  as  n  tends 
to  oo.  We  may  deduce  any  number  of  new  functions  from  1/n  by  altering  a 
finite  number  of  its  values.  For  instance  we  may  consider  the  function  4>{n) 
which  is  equal  to  3  for  n  =  1,  2,  7,  11,  101,  107,  109,  237  and  equal 
to  1/n  for  all  other  values  of  n.  For  this  function,  just  as  for  the  original 
function  1/n,  lim</>(n)  =  0.  Similarly,  for  the  function  0(n)  which  is  equal 
to  3  if  n  =  1,  2,  7,  11,  101,  107,  109,  237,  and  to  n2  otherwise,  it  is  true 
that  0(n)  -A  Too. 

(2)  On  the  other  hand  we  cannot  as  a  rule  alter  an  infinite  number  of 
the  values  of  <f>(n)  without  affecting  fundamentally  its  behaviour  as  n  tends 
to  oo.  If  for  example  we  altered  the  function  1/n  by  changing  its  value 
to  1  whenever  n  is  a  multiple  of  100,  it  would  no  longer  be  true  that 
lim  (f){n)  =  0.  So  long  as  a  finite  number  of  values  only  were  affected  we 
could  always  choose  the  number  n0  of  the  definition  so  as  to  be  greater 
than  the  greatest  of  the  values  of  n  for  which  0(n)  was  altered.  In  the 
examples  above,  for  instance,  we  could  always  take  no  >  237,  and  indeed 
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we  should  be  compelled  to  do  so  as  soon  as  our  imaginary  opponent  of  §  56 
had  assigned  a  value  of  e  as  small  as  3  (in  the  first  example)  or  a  value 
of  A  as  great  as  3  (in  the  second).  But  now  however  large  no  may  be  there 
will  be  greater  values  of  n  for  which  </>(n)  has  been  altered. 

(3)  In  applying  the  test  of  Definition  I  it  is  of  course  absolutely  essential 
that  we  should  have  |0(n)  — /|  <  e  not  merely  when  n  =  no  but  when  n  A  no, 
i.e.  for  n0  and  for  all  larger  values  of  n.  It  is  obvious,  for  example,  that, 
if  0(n)  is  the  function  last  considered,  then  given  e  we  can  choose  n0  so 
that  |0(n)|  <  e  when  n  =  Hq:  we  have  only  to  choose  a  sufficiently  large 
value  of  n  which  is  not  a  multiple  of  100.  But,  when  no  is  thus  chosen,  it 
is  not  true  that  |0(n)  |  <  e  when  n  A  no-  all  the  multiples  of  100  which  are 
greater  than  no  are  exceptions  to  this  statement. 

(4)  If  0(n)  is  always  greater  than  /,  we  can  replace  \<f>{n)  —  l\  by  0(n)  — 
Thus  the  test  whether  1/n  tends  to  the  limit  0  as  n  tends  to  oo  is  simply 
whether  1/n  <  e  when  n  A  no-  If  however  0(n)  =  (— l)n/n,  then  l  is 
again  0,  but  0(n)  —  l  is  sometimes  positive  and  sometimes  negative.  In 
such  a  case  we  must  state  the  condition  in  the  form  |0(n)  —  l\  <  e,  for 
example,  in  this  particular  case,  in  the  form  | </>(n)  |  <  e. 

(5)  The  limit  l  may  itself  be  one  of  the  actual  values  of  </>(n).  Thus  if 
(j){n)  =  0  for  all  values  of  n,  it  is  obvious  that  lim</>(n)  =  0.  Again,  if  we 
had,  in  (2)  and  (3)  above,  altered  the  value  of  the  function,  when  n  is  a 
multiple  of  100,  to  0  instead  of  to  1,  we  should  have  obtained  a  function 
0(n)  which  is  equal  to  0  when  n  is  a  multiple  of  100  and  to  1/n  otherwise. 
The  limit  of  this  function  as  n  tends  to  oo  is  still  obviously  zero.  This  limit 
is  itself  the  value  of  the  function  for  an  infinite  number  of  values  of  n,  viz. 
all  multiples  of  100. 

On  the  other  hand  the  limit  itself  need  not  ( and  in  general  will  not )  be 
the  value  of  the  function  for  any  value  of  n.  This  is  sufficiently  obvious  in 
the  case  of  4>{n)  =  1/n.  The  limit  is  zero;  but  the  function  is  never  equal 
to  zero  for  any  value  of  n. 

The  reader  cannot  impress  these  facts  too  strongly  on  his  mind.  A 
limit  is  not  a  value  of  the  function:  it  is  something  quite  distinct  from 
these  values,  though  it  is  defined  by  its  relations  to  them  and  may  possibly 
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be  equal  to  some  of  them.  For  the  functions 

0(n)  =  0,  1, 

the  limit  is  equal  to  all  the  values  of  0(n):  for 

0(n)  =  1/n,  (-1  )n/n,  1  +  (1/n),  1  +  {(-l)n/n} 

it  is  not  equal  to  any  value  of  (j>(n):  for 

4>{n)  =  (sin  ^mr)/n,  1  +  {(sin  |mr)/n} 

(whose  limits  as  n  tends  to  oo  are  easily  seen  to  be  0  and  1,  since  sin  \nn  is 
never  numerically  greater  than  1)  the  limit  is  equal  to  the  value  which  c j>(n ) 
assumes  for  all  even  values  of  n,  but  the  values  assumed  for  odd  values  of  n 
are  all  different  from  the  limit  and  from  one  another. 

(6)  A  function  may  be  always  numerically  very  large  when  n  is  very 
large  without  tending  either  to  +oo  or  to  — oo.  A  sufficient  illustration  of 
this  is  given  by  0(n)  =  (— 1  )nn.  A  function  can  only  tend  to  +oo  or  to  — oo 
if,  after  a  certain  value  of  n,  it  maintains  a  constant  sign. 

Examples  XXIII.  Consider  the  behaviour  of  the  following  functions  of  n 
as  n  tends  to  oo: 

1.  <f>{n)  =  nk,  where  A:  is  a  positive  or  negative  integer  or  rational  fraction. 
If  k  is  positive,  then  nk  tends  to  +oo  with  n.  If  k  is  negative,  then  lirn  nk  =  0. 
If  k  =  0,  then  nk  =  1  for  all  values  of  n.  Hence  lirn  nk  =  1. 

The  reader  will  find  it  instructive,  even  in  so  simple  a  case  as  this,  to  write 
down  a  formal  proof  that  the  conditions  of  our  definitions  are  satisfied.  Take 
for  instance  the  case  of  k  >  0.  Let  A  be  any  assigned  number,  however  large. 
We  wish  to  choose  no  so  that  nk  >  A  when  n  ^  no-  We  have  in  fact  only  to 
take  for  no  any  number  greater  than  \/A.  If  e.g.  k  =  4,  then  n4  >  10,000  when 
n  ^  11,  n4  >  100,000,000  when  n  ^  101,  and  so  on. 

2.  </>(n)  =  pn ,  where  pn  is  the  nth  prime  number.  If  there  were  only  a  finite 
number  of  primes  then  <p(n)  would  be  defined  only  for  a  finite  number  of  values 
of  n.  There  are  however,  as  was  first  shown  by  Euclid,  infinitely  many  primes. 
Euclid’s  proof  is  as  follows.  If  there  are  only  a  finite  number  of  primes,  let  them 
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be  1,  2,  3,  5,  7,  11,  . . .  N.  Consider  the  number  1  +  (1  •  2  •  3  •  5  •  7- 11 . . .  N).  This 
number  is  evidently  not  divisible  by  any  of  2,  3,  5,  . . .  N,  since  the  remainder 
when  it  is  divided  by  any  of  these  numbers  is  1.  It  is  therefore  not  divisible 
by  any  prime  save  1,  and  is  therefore  itself  prime,  which  is  contrary  to  our 
hypothesis. 

It  is  moreover  obvious  that  4>(n)  >  n  for  all  values  of  n  (save  n  =  1,  2,  3). 
Hence  (f{n)  -»  +oo. 

3.  Let  4>{n)  be  the  number  of  primes  less  than  n.  Here  again  (f>(n)  +oo. 

4.  <j>(n)  =  [an],  where  a  is  any  positive  number.  Here 

<f>(n)  =  0  (0  ^  n  <  1/ck),  <f>(n )  =  1  (1/a  ^  n  <  2/a), 

and  so  on;  and  f(n)  — >  +oo. 

5.  If  4>(n)  =  1,000,000/n,  then  lim  <f{n)  =  0:  and  if  f>(n)  =  n/1,000,000, 

then  iffn)  — >  +oo.  These  conclusions  are  in  no  way  affected  by  the  fact  that  at 
first  4>(n)  is  much  larger  than  being  in  fact  larger  until  n  =  1,000,000. 

6.  4>{n)  =  l/{n— (— l)n},  n  —  (— l)n,  n{l  — (— l)n}.  The  first  function  tends 
to  0,  the  second  to  +oo,  the  third  does  not  tend  either  to  a  limit  or  to  +oo. 

7.  (f>(n)  =  (smn9ir)/n,  where  9  is  any  real  number.  Here  |  </>(?z)  |  <  1/n, 
since  |  sinn07r|  ^  1,  and  lim  (j){n)  =  0. 

8.  4>(n)  =  (sin  n0vr)/y/n,  (a  cos2  nO  +  b  sin2  n6)/n,  where  a  and  b  are  any 
real  numbers. 

9.  <j>(ri)  =  sin  nOn.  If  9  is  integral  then  4>{n)  =  0  for  all  values  of  n,  and 
therefore  lim  (p(n)  =  0. 

Next  let  9  be  rational,  e.g.  9  =  p/q,  where  p  and  q  are  positive  integers.  Let 
n  =  aq  +  b  where  a  is  the  quotient  and  b  the  remainder  when  n  is  divided  by  q. 
Then  sin(np7r/g)  =  (— l)ap  sm(bpir/q).  Suppose,  for  example,  p  even;  then,  as 
n  increases  from  0  to  q  —  1,  (j>(n)  takes  the  values 

0,  sm(pir/q),  sm(2pir/q),  ...  sin{(g  —  l)pir/q}. 

When  n  increases  from  q  to  2q  —  1  these  values  are  repeated;  and  so  also  as 
n  goes  from  2q  to  3q  —  1,  3q  to  4q  —  1,  and  so  on.  Thus  the  values  of  (j>(n)  form 
a  perpetual  cyclic  repetition  of  a  finite  series  of  different  values.  It  is  evident 
that  when  this  is  the  case  <j>(n)  cannot  tend  to  a  limit,  nor  to  +oo,  nor  to  —  oo, 
as  n  tends  to  infinity. 

The  case  in  which  9  is  irrational  is  a  little  more  difficult.  It  is  discussed  in 
the  next  set  of  examples. 
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62.  Oscillating  Functions.  Definition.  When  <f>(n)  does  not  tend 
to  a  limit,  nor  to  +oo,  nor  to  —oo,  as  n  tends  to  oo,  we  say  that  <f>(n) 
oscillates  as  n  tends  to  oo. 

A  function  <f>{n)  certainly  oscillates  if  its  values  form,  as  in  the  case 
considered  in  the  last  example  above,  a  continual  repetition  of  a  cycle  of 
values.  But  of  course  it  may  oscillate  without  possessing  this  peculiarity. 
Oscillation  is  defined  in  a  purely  negative  manner:  a  function  oscillates 
when  it  does  not  do  certain  other  things. 

The  simplest  example  of  an  oscillatory  function  is  given  by 

0(n)  =  (-l)n, 

which  is  equal  to  +1  when  n  is  even  and  to  —1  when  n  is  odd.  In  this  case 
the  values  recur  cyclically.  But  consider 

0(n)  =  (-l)n  +  (1/n), 

the  values  of  which  are 

-1  +  1,  1  +  (1/2),  -  1  +  (1/3),  1  +  (1/4),  -  1  +  (1/5),  .... 

When  n  is  large  every  value  is  nearly  equal  to  +1  or  —1,  and  obviously 
<f>(n)  does  not  tend  to  a  limit  or  to  +oo  or  to  — oo,  and  therefore  it  oscillates: 
but  the  values  do  not  recur.  It  is  to  be  observed  that  in  this  case  every 
value  of  4>{n)  is  numerically  less  than  or  equal  to  3/2.  Similarly 

<j>(n)  =  (-l)n100  +  (1000/n) 

oscillates.  When  n  is  large,  every  value  is  nearly  equal  to  100  or  to  —100. 
The  numerically  greatest  value  is  900  (for  n  =  1).  But  now  consider 
4>(n)  =  (— 1  )nn,  the  values  of  which  are  —1,  2,  —3,  4,  —5,  ....  This  function 
oscillates,  for  it  does  not  tend  to  a  limit,  nor  to  +oo,  nor  to  — oo.  And 
in  this  case  we  cannot  assign  any  limit  beyond  which  the  numerical  value 
of  the  terms  does  not  rise.  The  distinction  between  these  two  examples 
suggests  a  further  definition. 

Definition.  If  (f>(n)  oscillates  as  n  tends  to  oo,  then  (f>(n)  will  be  said 
to  oscillate  finitely  or  infinitely  according  as  it  is  or  is  not  possible  to 
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assign  a  number  K  such  that  all  the  values  of  <p(n)  are  numerically  less 
than  K,  i.e.  \<j>(n)\  <  K  for  all  values  of  n. 

These  definitions,  as  well  as  those  of  §§  58  and  60,  are  further  illustrated 
in  the  following  examples. 

Examples  XXIV.  Consider  the  behaviour  as  n  tends  to  oo  of  the  fol¬ 
lowing  functions: 

1.  (-l)n,  5  +  3(— l)n,  (1,000,000/n)  +  (-l)n,  l,000,000(-l)n  +  (1/n). 

2.  (-l)nn,  1,000,000  + (-l)nn. 

3.  1,000,000  -  n,  (-l)n(l,000,000  -  n). 

4.  n{l  +  (— l)n}.  In  this  case  the  values  of  f(n)  are 

0,  4,  0,  8,  0,  12,  0,  16,  .... 

The  odd  terms  are  all  zero  and  the  even  terms  tend  to  +oo:  <j>{n)  oscillates 
infinitely. 

5.  n 2  +  (— l)n2n.  The  second  term  oscillates  infinitely,  but  the  first  is  very 
much  larger  than  the  second  when  n  is  large.  In  fact  fin)  if  n2  —  2 n  and 
n2  —  2n  =  (n  —  l)2  —  1  is  greater  than  any  assigned  value  A  if  n  >  1  +  ^ A  +  1. 
Thus  f(n)  -+  +oo.  It  should  be  observed  that  in  this  case  f(2k  +  1)  is  always 
less  than  f{2 k),  so  that  the  function  progresses  to  infinity  by  a  continual  series 
of  steps  forwards  and  backwards.  It  does  not  however  ‘oscillate’  according  to 
our  definition  of  the  term. 

6.  n2{l  +  (— l)n},  (— l)nn2  +  n,  n3  +  (— l)n?z2. 

7.  sin  tiOtt.  We  have  already  seen  (Exs.  xxm.  9)  that  <j>{n)  oscillates  finitely 
when  6  is  rational,  unless  6  is  an  integer,  when  f(n)  =  0,  <j>{n)  — >  0. 

The  case  in  which  8  is  irrational  is  a  little  more  difficult.  But  it  is  not  difficult 
to  see  that  f(n)  still  oscillates  finitely.  We  can  without  loss  of  generality  suppose 
0  <  9  <  1.  In  the  first  place  |0(n)|  <  1.  Hence  f{n)  must  oscillate  finitely  or  tend 
to  a  limit.  We  shall  consider  whether  the  second  alternative  is  really  possible. 
Let  us  suppose  that 

lirn  sin  nOr  =  l. 

Then,  however  small  e  may  be,  we  can  choose  no  so  that  sin  n6n  lies  between 
l  —  e  and  l  +  e  for  all  values  of  n  greater  than  or  equal  to  no-  Hence 
sin(n  +  l)8ir  —  sin  nOn  is  numerically  less  than  2e  for  all  such  values  of  n,  and 
so  |  sin  2#vrcos(n  +  <  £■ 
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Hence 

cos (n  +  \)0ir  =  cosnOircos  \0ir  —  sinnf?7rsin  \0ir 
must  be  numerically  less  than  e/|  sin  \0ir\.  Similarly 

cos (n  —  \ )8ir  =  cosnOircos  \0ir  +  sinnf?7rsin  \0ir 

must  be  numerically  less  than  e/|  sin  ^07r|;  and  so  each  of  cosnOircos^Oir, 
sinnOir  sinnOir  must  be  numerically  less  than  e/|sin^07r|.  That  is  to  say, 
cosn07rcos  \0ir  is  very  small  if  n  is  large,  and  this  can  only  be  the  case  if 
cosnOir  is  very  small.  Similarly  sinnOir  must  be  very  small,  so  that  l  must  be 
zero.  But  it  is  impossible  that  cosnOir  and  sinn07r  can  both  be  very  small,  as 
the  sum  of  their  squares  is  unity.  Thus  the  hypothesis  that  sin  nOir  tends  to  a 
limit  l  is  impossible,  and  therefore  sinn07r  oscillates  as  n  tends  to  oo. 

The  reader  should  consider  with  particular  care  the  argument  ‘cos  nOir  cos  \0ir 
is  very  small,  and  this  can  only  be  the  case  if  cosnOir  is  very  small’.  Why,  he 
may  ask,  should  it  not  be  the  other  factor  cos^vr  which  is  ‘very  small’?  The 
answer  is  to  be  found,  of  course,  in  the  meaning  of  the  phrase  ‘very  small’  as 
used  in  this  connection.  When  we  say  ‘</>(n)  is  very  small’  for  large  values  of  n, 
we  mean  that  we  can  choose  no  so  that  4>(n)  is  numerically  smaller  than  any 
assigned  number,  if  n  ^  no-  Such  an  assertion  is  palpably  absurd  when  made 
of  a  fixed  number  such  as  cos  \0ir,  which  is  not  zero. 

Prove  similarly  that  cosnOir  oscillates  finitely,  unless  0  is  an  even  integer. 

8.  sinnOir  +  (1/n),  sinnOir  +  1,  sinnOir  +  n,  (— l)n  sinnOir. 

9.  a  cos  nOir  +  b  sin  nOir,  sin2  nOir,  a  cos2  nOir  +  b  sin2  nOir. 

10.  a  +  bn  +  (— 1  )n(c  +  dn)  +  e  cosnOir  +  f  sinnOir. 

11.  nsinnOir.  If  0  is  integral,  then  cf>(n)  =  0,  fi(n)  — >  0.  If  0  is  rational  but 
not  integral,  or  irrational,  then  4>{n)  oscillates  infinitely. 

12.  n(a  cos2  nOir  +  6  sin2  nOir).  In  this  case  4>{n)  tends  to  +oo  if  a  and  b  are 
both  positive,  but  to  — oo  if  both  are  negative.  Consider  the  special  cases  in 
which  a  =  0,  b  >  0,  or  a  >  0,  b  =  0,  or  a  =  0,  b  =  0.  If  a  and  b  have  opposite 
signs  cj)(n )  generally  oscillates  infinitely.  Consider  any  exceptional  cases. 

13.  sin(n2#7r).  If  0  is  integral,  then  fi{n)  — >  0.  Otherwise  4>(n)  oscillates 
finitely,  as  may  be  shown  by  arguments  similar  to  though  more  complex  than 
those  used  in  Exs.  xxiii.  9  and  XXIV.  7.* 

*See  Bromwich’s  Infinite  Series ,  p.  485. 
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14.  sin(n!  On).  If  0  has  a  rational  value  p/q,  then  n!  0  is  certainly  integral  for 
all  values  of  n  greater  than  or  equal  to  q.  Hence  4>(n)  — >  0.  The  case  in  which 
0  is  irrational  cannot  be  dealt  with  without  the  aid  of  considerations  of  a  much 
more  difficult  character. 

15.  cos(n!  On),  a  cos2(n!  On)  +  6sin2(n!  On),  where  0  is  rational. 

16.  an  —  [bn],  (— 1  )n(an—  [bn]). 

17.  [y/n\,  (~l)n[y/n\,  s/n-  [y/n). 

18.  The  smallest  prime  factor  of  n.  When  n  is  a  prime,  <$>(n)  =  n.  When 
n  is  even,  <j>(n)  =  2.  Thus  <j>(n)  oscillates  infinitely. 

19.  The  largest  prime  factor  of  n. 

20.  The  number  of  days  in  the  year  n  A.D. 

Examples  XXV.  1.  If  <f>(n)  — >  Too  and  if(n)  ^  4>(n)  for  all  values  of  n, 
then  if(n)  — >  +oo. 

2.  If  4>(n)  — >•  0,  and  \if(n)\  ^  \(t){n)\  for  all  values  of  n,  then  if(n)  — >  0. 

3.  If  lim  \4>(n)\  =  0,  then  lirn (f)(n)  =  0. 

4.  If  4>(n)  tends  to  a  limit  or  oscillates  finitely,  and  |^(n)|  ^  |(/>(n)|  when 
n  ^  no,  then  ifi(n)  tends  to  a  limit  or  oscillates  finitely. 

5.  If  f>(n)  tends  to  +oo,  or  to  — oo,  or  oscillates  infinitely,  and 

|V>(n)|  ^  \(f{n)\ 

when  n  ^  no,  then  i/)(n)  tends  to  +oo  or  to  — oo  or  oscillates  infinitely. 

6.  ‘If  4>(n)  oscillates  and,  however  great  be  no,  we  can  find  values  of  n 
greater  than  no  for  which  ip(n)  >  4>(n),  and  values  of  n  greater  than  no  for 
which  ip(n)  <  <j>(n),  then  ip(n)  oscillates’.  Is  this  true?  If  not  give  an  example 
to  the  contrary. 

7.  If  4>{n)  -A  l  as  n  — >  oo,  then  also  f>{n  +  p)  — >•  l,  p  being  any  fixed  integer. 
[This  follows  at  once  from  the  definition.  Similarly  we  see  that  if  <p{n)  tends 
to  Too  or  — oo  or  oscillates  so  also  does  <f(n  Tp).] 

8.  The  same  conclusions  hold  (except  in  the  case  of  oscillation)  if  p  varies 
with  n  but  is  always  numerically  less  than  a  fixed  positive  integer  N ;  or  if  p  varies 
with  n  in  any  way,  so  long  as  it  is  always  positive. 

9.  Determine  the  least  value  of  no  for  which  it  is  true  that 

(a)  n2  T  2 n  >  999,999  (n  T  no),  ( b )  n2  T  2 n  >  1,000,000  (n  T  no). 
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10.  Determine  the  least  value  of  no  for  which  it  is  true  that 

(a)  n  +  (-l)n  >  1000  (nAn0),  (b)  n  +  (-l)n  >  1,000,000  (n  A  n0). 

11.  Determine  the  least  value  of  no  for  which  it  is  true  that 

(a)  n2  +  2n  >  A  (n  A  no),  ( b )  n  +  (— l)n  >  A  (n  A  no), 

A  being  any  positive  number. 

[(a)  no  =  [\/A  +  1]:  (b)  no  =  1  +  [A]  or  2  +  [A],  according  as  [A]  is  odd  or 
even,  be.  n0  =  1  +  [A]  +  I{1  +  (-1)[A1}.] 

12.  Determine  the  least  value  of  no  such  that 

(a)  n/(n2  +  1)  <  .0001,  ( b )  (1/n)  +  {(-l)n/n2}  <  .00001, 

when  n  A  no-  [Let  us  take  the  latter  case.  In  the  first  place 

(1/n)  +  {(— l)n/n2}  A  (n  +  l)/n2, 

and  it  is  easy  to  see  that  the  least  value  of  71q,  such  that  (n  +  l)/n2  <  .000  001 
when  n  A  no,  is  1,000,002.  But  the  inequality  given  is  satisfied  by  n  =  1,000,001, 
and  this  is  the  value  of  no  required.] 

63.  Some  general  theorems  with  regard  to  limits.  A.  The 
behaviour  of  the  sum  of  two  functions  whose  behaviour  is  known. 

Theorem  I.  If  4>(n)  and  ip(n)  tend  to  limits  a,  b,  then  4>(n)  +  n ) 

tends  to  the  limit  a  +  b. 

This  is  almost  obvious.*  The  argument  which  the  reader  will  at  once 
form  in  his  mind  is  roughly  this:  ‘when  n  is  large,  </>(n)  is  nearly  equal  to  a 

*  There  is  a  certain  ambiguity  in  this  phrase  which  the  reader  will  do  well  to  notice. 
When  one  says  ‘such  and  such  a  theorem  is  almost  obvious’  one  may  mean  one  or  other 
of  two  things.  One  may  mean  ‘it  is  difficult  to  doubt  the  truth  of  the  theorem’,  ‘the 
theorem  is  such  as  common-sense  instinctively  accepts’,  as  it  accepts,  for  example,  the 
truth  of  the  propositions  ‘2  +  2  =  4’  or  ‘the  base-angles  of  an  isosceles  triangle  are 
equal’.  That  a  theorem  is  ‘obvious’  in  this  sense  does  not  prove  that  it  is  true,  since 
the  most  confident  of  the  intuitive  judgments  of  common  sense  are  often  found  to  be 
mistaken;  and  even  if  the  theorem  is  true,  the  fact  that  it  is  also  ‘obvious’  is  no  reason 
for  not  proving  it,  if  a  proof  can  be  found.  The  object  of  mathematics  is  to  prove  that 
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and  t/>(n)  to  b,  and  therefore  their  sum  is  nearly  equal  to  a  +  b\  It  is  well 
to  state  the  argument  quite  formally,  however. 

Let  e  be  any  assigned  positive  number  (e.g.  .001,  .000  0001,  . . .).  We 
require  to  show  that  a  number  no  can  be  found  such  that 

| <j){n)  +  ip(n)  -  a-b\  <  e,  (1) 

when  n  ^  n0.  Now  by  a  proposition  proved  in  Chap.  Ill  (more  generally 
indeed  than  we  need  here)  the  modulus  of  the  sum  of  two  numbers  is  less 
than  or  equal  to  the  sum  of  their  moduli.  Thus 

1 0(n)  +  tf{n)  —  a  —  b\  ^  1 0(n)  —  o|  +  | if)(n)  —  b\. 

It  follows  that  the  desired  condition  will  certainly  be  satisfied  if  no  can  be 
so  chosen  that 

1 0(n)  -  a|  +  | ip(n)  -  b\  <  e,  (2) 

when  n  ^  no-  But  this  is  certainly  the  case.  For  since  lim0(n)  =  a  we  can, 
by  the  definition  of  a  limit,  find  n\  so  that  \<f>(n )  —  a\  <  e'  when  n  ^  ni, 
and  this  however  small  e'  may  be.  Nothing  prevents  our  taking  e!  =  |e, 
so  that  \4>(n)  —  a\  <  |e  when  n  ^  ri\ .  Similarly  we  can  find  n2  so  that 
| if(n)  —  5|  <  when  n  ^  n2.  Now  take  no  to  be  the  greater  of  the  two 
numbers  ri\,  n2.  Then  | <j>{n)  —  a\  <  and  \if{n)  —  b\  <  \e  when  n  ^  no, 
and  therefore  (2)  is  satisfied  and  the  theorem  is  proved. 

The  argument  may  be  concisely  stated  thus:  since  lim^n)  =  a  and 
lirri  fjfri)  =  b,  we  can  choose  ni,  n2  so  that 

1 4>(n)  —  a|  <  \e  (n  ^  ni),  | if(n)  —  b\<\e  (n  ^  n2 ); 

certain  premises  imply  certain  conclusions;  and  the  fact  that  the  conclusions  may  be  as 
‘obvious’  as  the  premises  never  detracts  from  the  necessity,  and  often  not  even  from  the 
interest  of  the  proof. 

But  sometimes  (as  for  example  here)  we  mean  by  ‘this  is  almost  obvious’  something 
quite  different  from  this.  We  mean  ‘a  moment’s  reflection  should  not  only  convince  the 
reader  of  the  truth  of  what  is  stated,  but  should  also  suggest  to  him  the  general  lines  of 
a  rigorous  proof’.  And  often,  when  a  statement  is  ‘obvious’  in  this  sense,  one  may  well 
omit  the  proof,  not  because  the  proof  is  in  any  sense  unnecessary,  but  because  it  is  a 
waste  of  time  and  space  to  state  in  detail  what  the  reader  can  easily  supply  for  himself. 
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and  then,  if  n  is  not  less  than  either  n±  or  ri2 , 


1 4>(n)  +  n )  —  a  —  b\  ^  \f(n)  —  a\  +  \if(n)  —  b\  <  e; 


and  therefore 


lim{0(n)  +  fi(n)}  =  a  +  b. 


64.  Results  subsidiary  to  Theorem  I.  The  reader  should  have  no 
difficulty  in  verifying  the  following  subsidiary  results. 

1.  If  (f}(n)  tends  to  a  limit,  but  fi>{n)  tends  to  +oo  or  to  — oo  or  oscil¬ 
lates  finitely  or  infinitely,  then  <j>(n)  +  ip(n)  behaves  like  ip(n). 

2.  If  <j)(n)  — *  +oo,  and  ip(n)  — *  +cxd  or  oscillates  finitely,  then 
<f>(n)  +  fi>{n)  — >  +cxd. 

In  this  statement  we  may  obviously  change  +oo  into  —  oo  throughout. 

3.  If  (p(n)  — *  -Too  andfi>(n)  — *  — oo,  then  fifn)  may  tend  either 

to  a  limit  or  to  +oo  or  to  — oo  or  may  oscillate  either  finitely  or  infinitely. 

These  five  possibilities  are  illustrated  in  order  by  (i)  ( f>(n)  =  n ,  fi(n)  =  —n, 
(ii)  4>(n )  =  n2,  fi(n)  =  — n,  (iii)  cfifii)  =  n,  fi(n)  =  —  n2,  (iv)  4>(n)  =  n  +  (— l)n, 
ip(n)  =  —  n,  (v)  f(n)  =  n2  +  (— l)nn,  if(n)  =  —n2.  The  reader  should  construct 
additional  examples  of  each  case. 

4.  If  0(n)  — >  +oo  andfi>(n)  oscillates  infinitely,  then  4>(n)  may 

tend  to  +oo  or  oscillate  infinitely,  but  cannot  tend  to  a  limit,  or  to  — oo, 
or  oscillate  finitely. 

For  fi{n)  =  {4>(n)  +  fi(n)}  —  </>(n);  and,  if  <f(n)  +  fi(n)  behaved  in  any  of  the 
three  last  ways,  it  would  follow,  from  the  previous  results,  that  fi{n)  —*■  — oo, 
which  is  not  the  case.  As  examples  of  the  two  cases  which  are  possible,  consider 
(i)  4>(n)  =  n2,  fi{n)  =  (— l)nn,  (ii)  (j){n)  =  n,  fi(n)  =  (— 1  )nn2 .  Here  again  the 
signs  of  +oo  and  — oo  may  be  permuted  throughout. 

5.  If  <f>fn)  andfi>(n)  both  oscillate  finitely,  then  <f(n)  must  tend 

to  a  limit  or  oscillate  finitely. 

As  examples  take 


n+1 


(i)  <Kn)  =  (-iy\  V>(n)  =  (- 1) 


(ii)  (j>{n)  =  fi{n)  =  (-l)n. 
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6.  If  <f>(n)  oscillates  finitely,  and  if(n)  infinitely,  then  (j){n)  +  if(n) 
oscillates  infinitely. 

For  4>{n)  is  in  absolute  value  always  less  than  a  certain  constant,  say  K.  On 
the  other  hand  if(n),  since  it  oscillates  infinitely,  must  assume  values  numerically 
greater  than  any  assignable  number  ( e.g .  10K,  100/i  ,  . . . ).  Hence  f(n)  +  if(n) 
must  assume  values  numerically  greater  than  any  assignable  number  (e.g.  9 K, 
99 K,  ...).  Hence  4>(n)  +  fi(n)  must  either  tend  to  +oo  or  — oo  or  oscillate 
infinitely.  But  if  it  tended  to  +oo  then 

fi(n)  =  {4>(n)  +  fi(n)}  —  <f(n) 

would  also  tend  to  +oo,  in  virtue  of  the  preceding  results.  Thus  <f(n)  +  fi(n) 
cannot  tend  to  +oo,  nor,  for  similar  reasons,  to  — oo:  hence  it  oscillates  infinitely. 

7.  If  both  (j)(n )  and  if  (n)  oscillate  infinitely,  then  (f(n)+if(n)  may  tend 
to  a  limit,  or  to  +oo;  or  to  —oo,  or  oscillate  either  finitely  or  infinitely. 

Suppose,  for  instance,  that  <f(n )  =  (— l)nn,  while  ip(n)  is  in  turn  each  of  the 
functions  (— 1  )n+1n,  {1  +  (— l)n+1}n,  —  {1  +  (— l)n}n,  (— l)"+1(n  +  1),  (— 1  )nn. 
We  thus  obtain  examples  of  all  five  possibilities. 

The  results  1-7  cover  all  the  cases  which  are  really  distinct.  Before 
passing  on  to  consider  the  product  of  two  functions,  we  may  point  out  that 
the  result  of  Theorem  I  may  be  immediately  extended  to  the  sum  of  three 
or  more  functions  which  tend  to  limits  as  n  — >■  oo. 

65.  B.  The  behaviour  of  the  product  of  two  functions  whose 
behaviour  is  known.  We  can  now  prove  a  similar  set  of  theorems  con¬ 
cerning  the  product  of  two  functions.  The  principal  result  is  the  following. 

Theorem  II.  If  lim  0(n)  =  a  mid  lim^(n)  =  b,  then 

lim  (f(n)fi>(n)  =  ab. 


Let 


fi(n)  =  a  +  fii(n),  if(n)  =  b  +  ipi(n), 
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so  that  lim0i(n)  =  0  and  lim  'ip\ (n)  =  0.  Then 

0(n),0(n)  =  ab  +  aij)\(n)  +  b(j>i(n)  +  0i(n)^i(n). 

Hence  the  numerical  value  of  the  difference  —  ab  is  certainly  not 

greater  than  the  sum  of  the  numerical  values  of  a,0i(n),  50i(n),  4>\(n)ij}\(n). 
From  this  it  follows  that 


lim{0(n),0(n)  —  ab}  =  0, 

which  proves  the  theorem. 

The  following  is  a  strictly  formal  proof.  We  have 

\cj>(n)ijj(n)  -  ab\  |a^i(n)|  +  \b<f>i(n)\  +  |</>i(n)||^i(n)|. 

Assuming  that  neither  a  nor  b  is  zero,  we  may  choose  no  so  that 

|<Mn)l  <  |e/|6| ,  IV’i(n)!  <  |e/|a|, 

when  n  ^  no-  Then 

| cj)(n)i/j(n)  -  ab\  <  +  {|e2/(M|6|)}, 

which  is  certainly  less  than  e  if  e  <  ||a||6|.  That  is  to  say  we  can  choose  no  so 
that  |0(n)^(n)  —  ab\  <  e  when  n  ^  no,  and  so  the  theorem  follows.  The  reader 
should  supply  a  proof  for  the  case  in  which  at  least  one  of  a  and  b  is  zero. 

We  need  hardly  point  out  that  this  theorem,  like  Theorem  I,  may  be 
immediately  extended  to  the  product  of  any  number  of  functions  of  n. 
There  is  also  a  series  of  subsidiary  theorems  concerning  products  analogous 
to  those  stated  in  §  64  for  sums.  We  must  distinguish  now  six  different  ways 
in  which  <p(n)  may  behave  as  n  tends  to  oo.  It  may  (1)  tend  to  a  limit 
other  than  zero ,  (2)  tend  to  zero,  (3a)  tend  to  +oo,  (3 b)  tend  to  — oo, 
(4)  oscillate  finitely,  (5)  oscillate  infinitely.  It  is  not  necessary,  as  a  rule,  to 
take  account  separately  of  (3a)  and  (3  b),  as  the  results  for  one  case  may 
be  deduced  from  those  for  the  other  by  a  change  of  sign. 

To  state  these  subsidiary  theorems  at  length  would  occupy  more  space  than 
we  can  afford.  We  select  the  two  which  follow  as  examples,  leaving  the  verifica¬ 
tion  of  them  to  the  reader.  He  will  find  it  an  instructive  exercise  to  formulate 
some  of  the  remaining  theorems  himself. 
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(i)  If  4>{ri)  -A  Too  and  if  (71)  oscillates  finitely,  then  (p{n)fi{n)  must  tend 
to  Too  or  to  —00  or  oscillate  infinitely. 

Examples  of  these  three  possibilities  may  be  obtained  by  taking  <f{n)  to  be  n 
and  ip(n)  to  be  one  of  the  three  functions  2  T  (— l)n,  —2  —  (— l)n,  (— l)n. 

(ii)  If  4>(n)  and  if(n)  oscillate  finitely,  then  <f(n)fi{n)  must  tend  to  a  limit 
{which  may  be  zero)  or  oscillate  finitely. 

For  examples,  take  (a)  fi{n)  =  ip{n)  =  (— l)n,  (6)  <f>{n)  =  1  T  (— l)n, 
ip{n)  =  1  —  (— l)n,  and  (c)  (f{n)  =  cos  it,  fi{n)  =  sin  it. 

A  particular  case  of  Theorem  II  which  is  important  is  that  in  which 
fi>{n)  is  constant.  The  theorem  then  asserts  simply  that  lim  k(p{n)  =  ka  if 
lim0(n)  =  a.  To  this  we  may  join  the  subsidiary  theorem  that  if  (j){n)  -T 
Too  then  kfifn)  — *  Too  or  kfi{n)  — >  —00,  according  as  k  is  positive  or 
negative,  unless  k  =  0,  when  of  course  kfiin)  =  0  for  all  values  of  n  and 
lim  kfi{n)  =  0.  And  if  (f){n)  oscillates  finitely  or  infinitely,  then  so  does 
k(f>{n),  unless  k  =  0. 

66.  C.  The  behaviour  of  the  difference  or  quotient  of  two 
functions  whose  behaviour  is  known.  There  is,  of  course,  a  similar 
set  of  theorems  for  the  difference  of  two  given  functions,  which  are  obvious 
corollaries  from  what  precedes.  In  order  to  deal  with  the  quotient 

4>{n) 

we  begin  with  the  following  theorem. 

Theorem  III.  If  lim  4>{n)  =  a,  and  a  is  not  zero,  then 

y  1  1 

lim  ATT  =  “■ 

<p(n)  a 


Let 


(f>(n)  —  a  T  (pi  (n), 


so  that  lim01(n)  =  0.  Then 
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and  it  is  plain,  since  limc^n)  =  0,  that  we  can  choose  Uq  so  that  this  is 
smaller  than  any  assigned  number  e  when  n  V  no. 

From  Theorems  II  and  III  we  can  at  once  deduce  the  principal  theorem 
for  quotients,  viz. 

Theorem  IV.  //Iim0(n)  =  a  and  lim^(n)  =  b,  and  b  is  not  zero, 
then 

4>{n)  _  a 
im  ip(n)  b ' 

The  reader  will  again  find  it  instructive  to  formulate,  prove,  and  il¬ 
lustrate  by  examples  some  of  the  ‘subsidiary  theorems’  corresponding  to 
Theorems  III  and  IV. 

67.  Theorem  V.  If  R{<p(n),  ip  (n),  y(n), . . .  }  is  any  rational  function 
ofcppn),  ippn),  x(n)>  ■■■  }  he.  any  function  of  the  form 

P{cp(n),ip(n),x(n), . . .  }/Q{cp(n),ip(n),x(n), 

where  P  and  Q  denote  polynomials  in  <p(n),  ipfn),  y(n),  . . . .'  and  if 

lim  (Pin)  =  a,  lim^R)  =  6,  limy(n)  =  c,  . . . , 

and 

Q(a,  b,c, . . .)  ±  0; 

then 

lim  R{cp(n),ip(n),  y(n), . . .  }  =  R(a,  b,  c, . . . ). 

For  P  is  a  sum  of  a  finite  number  of  terms  of  the  type 

A{(p(n)}p{ip(n)}9 . . .  , 

where  A  is  a  constant  and  p,  q,  . . .  positive  integers.  This  term,  by  The¬ 
orem  II  (or  rather  by  its  obvious  extension  to  the  product  of  any  number 
of  functions)  tends  to  the  limit  Aapbq . . . ,  and  so  P  tends  to  the  limit 
P(a,  b,  c, . . . ),  by  the  similar  extension  of  Theorem  I.  Similarly  Q  tends  to 
Q(a,b,  c, . . . );  and  the  result  then  follows  from  Theorem  IV. 
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68.  The  preceding  general  theorem  may  be  applied  to  the  following 
very  important  particular  problem:  what  is  the  behaviour  of  the  most  gen¬ 
eral  rational  function  of  n,  viz. 


a0np  +  Gp  np  1  +  •  ■  ■  +  ap 
b0nq  +  bini-1  + - b  bq  ’ 


as  n  tends  to  oo?* 

In  order  to  apply  the  theorem  we  transform  S(n)  by  writing  it  in  the 


The  function  in  curly  brackets  is  of  the  form  i?{0(n)},  where  (fin)  =  1/n, 
and  therefore  tends,  as  n  tends  to  oo,  to  the  limit  -R(O)  =  a0/b0.  Now 
np~q  — >  0  if  p  <  q;  np~q  =  1  and  np~q  — >  1  if  p  —  q;  and  np~q  — >  +cx)  if 
p  >  q.  Hence,  by  Theorem  II, 


limS'(n)  =  0  (p  <  q), 
lim  S(n)=a0/b0  ( p  =  q ), 

S(n)  —>■  +oo  ( p  >  q,  a,o/b0  positive ), 
S(n)  — >  —oo  ( p  >  q,  ao/bo  negative ). 


Examples  XXVI. 


n  —  1 
n  +  1 


1. 

(-1)" 


What  is  the  behaviour  of  the  functions 

dVr)2,  (-D"=Vt, 

\n  + 1  /  n  n 


as  n  — >  oo? 

2.  Which  (if  any)  of  the  functions 

l/(cos2  },nir  +  nsin2  ^nir),  l/{n(cos2  \mr  +  n sin2  ln7r)}, 
(ncos2  ln7r  +  sin2  ln7r)/{n(cos2  +  nsin2  \nn)} 


tend  to  a  limit  as  n  — >  oo? 


We  naturally  suppose  that  neither  ao  nor  &o  is  zero. 
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3.  Denoting  by  S(n)  the  general  rational  function  of  n  considered  above, 
show  that  in  all  cases 


S(n  +  1) 
S(n 


=  1, 


lim 


S{n  +  (1/n)} 

W) 


=  1. 


69.  Functions  of  n  which  increase  steadily  with  n.  A  special 
but  particularly  important  class  of  functions  of  n  is  formed  by  those  whose 
variation  as  n  tends  to  oo  is  always  in  the  same  direction,  that  is  to  say 
those  which  always  increase  (or  always  decrease)  as  n  increases.  Since 
— 4>(n)  always  increases  if  (f>(n)  always  decreases,  it  is  not  necessary  to 
consider  the  two  kinds  of  functions  separately;  for  theorems  proved  for  one 
kind  can  at  once  be  extended  to  the  other. 

Definition.  The  function  (f>(n)  will  be  said  to  increase  steadily  with  n 
if  (j)(n  +  1)  A  (f>(n)  for  all  values  of  n. 

It  is  to  be  observed  that  we  do  not  exclude  the  case  in  which  (f>(n)  has 
the  same  value  for  several  values  of  n;  all  we  exclude  is  possible  decrease. 
Thus  the  function 

4>{n)  =  2  n  +  (-l)n, 

whose  values  for  n  =  0,  1,  2,  3,  4,  . . .  are 

1,  1,  5,  5,  9,  9,  ... 

is  said  to  increase  steadily  with  n.  Our  definition  would  indeed  include 
even  functions  which  remain  constant  from  some  value  of  n  onwards;  thus 
4>{n)  =  1  steadily  increases  according  to  our  definition.  However,  as  these 
functions  are  extremely  special  ones,  and  as  there  can  be  no  doubt  as  to 
their  behaviour  as  n  tends  to  oo,  this  apparent  incongruity  in  the  definition 
is  not  a  serious  defect. 

There  is  one  exceedingly  important  theorem  concerning  functions  of 
this  class. 

Theorem.  If  <f>(n)  steadily  increases  with  n,  then  either  (i)  <f>(n)  tends 
to  a  limit  as  n  tends  to  oo,  or  (ii)  (j){n)  — »  +oo. 

That  is  to  say,  while  there  are  in  general  five  alternatives  as  to  the 
behaviour  of  a  function,  there  are  two  only  for  this  special  kind  of  function. 
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This  theorem  is  a  simple  corollary  of  Dedekind’s  Theorem  (§  17).  We 
divide  the  real  numbers  £  into  two  classes  L  and  R ,  putting  £  in  L  or  R 
according  as  <f>{n)  A  £  for  some  value  of  n  (and  so  of  course  for  all  greater 
values),  or  <j>(n)  <  £  for  all  values  of  n. 

The  class  L  certainly  exists;  the  class  R  may  or  may  not.  If  it  does 
not,  then,  given  any  number  A,  however  large,  <p(n)  >  A  for  all  sufficiently 
large  values  of  n,  and  so 

£>(n)  — y  Too. 

If  on  the  other  hand  R  exists,  the  classes  L  and  R  form  a  section  of 
the  real  numbers  in  the  sense  of  §  17.  Let  a  be  the  number  corresponding 
to  the  section,  and  let  e  be  any  positive  number.  Then  c j>(n )  <  a  +  e  for 
all  values  of  n,  and  so,  since  e  is  arbitrary,  <f>(n)  A  a.  On  the  other  hand 
4>{n)  >  a  —  e  for  some  value  of  n ,  and  so  for  all  sufficiently  large  values. 
Thus 

a  —  e  <  £>(n)  A  a 

for  all  sufficiently  large  values  of  n;  i.e. 

4>{n)  — »  a. 

It  should  be  observed  that  in  general  <f>(n)  <  a  for  all  values  of  n;  for  if 
(j>(n)  is  equal  to  a  for  any  value  of  n  it  must  be  equal  to  a  for  all  greater  values 
of  n.  Thus  </>(n)  can  never  be  equal  to  a  except  in  the  case  in  which  the  values 
of  4>(n)  are  ultimately  all  the  same.  If  this  is  so,  a  is  the  largest  member  of  L; 
otherwise  L  has  no  largest  member. 

Cor  1.  If  (f)(n)  increases  steadily  with  n,  then  it  will  tend  to  a  limit  or 
to  Too  according  as  it  is  or  is  not  possible  to  find  a  number  K  such  that 
0(n)  <  K  for  all  values  of  n. 

We  shall  find  this  corollary  exceedingly  useful  later  on. 

Cor  2.  If  <f{n)  increases  steadily  with  n,  and  fi(n)  <  K  for  all  values 
of  n,  then  <f>{n)  tends  to  a  limit  and  this  limit  is  less  than  or  equal  to  K. 

It  should  be  noticed  that  the  limit  may  be  equal  to  K:  if  e.g. 
<p(n)  =  3  —  (1/n),  then  every  value  of  <f>(n)  is  less  than  3,  but  the  limit  is 
equal  to  3. 
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Cor  3.  If  4>(ri)  increases  steadily  with  n,  and  tends  to  a  limit,  then 

4>{n)  ^  lim0(n) 


for  all  values  of  n. 

The  reader  should  write  out  for  himself  the  corresponding  theorems  and 
corollaries  for  the  case  in  which  <f>(n)  decreases  as  n  increases. 

70.  The  great  importance  of  these  theorems  lies  in  the  fact  that  they 
give  us  (what  we  have  so  far  been  without)  a  means  of  deciding,  in  a  great 
many  cases,  whether  a  given  function  of  n  does  or  does  not  tend  to  a 
limit  as  n  — >  oo,  without  requiring  us  to  be  able  to  guess  or  otherwise  infer 
beforehand  what  the  limit  is.  If  we  know  what  the  limit,  if  there  is  one, 
must  be,  we  can  use  the  test 

1 4>(n)  —  l\  <  e  (n  ^  n0)  : 

as  for  example  in  the  case  of  <f>(n)  =  1/n,  where  it  is  obvious  that  the  limit 
can  only  be  zero.  But  suppose  we  have  to  determine  whether 

*">=(1  +  s)" 

tends  to  a  limit.  In  this  case  it  is  not  obvious  what  the  limit,  if  there  is 
one,  will  be:  and  it  is  evident  that  the  test  above,  which  involves  l,  cannot 
be  used,  at  any  rate  directly,  to  decide  whether  l  exists  or  not. 

Of  course  the  test  can  sometimes  be  used  indirectly,  to  prove  by  means  of 
a  reductio  ad  absurdum  that  l  cannot  exist.  If  e.g.  (j>{n)  =  (— l)n,  it  is  clear 
that  l  would  have  to  be  equal  to  1  and  also  equal  to  —1,  which  is  obviously 
impossible. 

71.  Alternative  proof  of  Weierstrass’s  Theorem  of  §  19.  The 

results  of  §  69  enable  us  to  give  an  alternative  proof  of  the  important  theorem 
proved  in  §  19. 

If  we  divide  PQ  into  two  equal  parts,  one  at  least  of  them  must  contain 
infinitely  many  points  of  S.  We  select  the  one  which  does,  or,  if  both  do,  we 
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P 


P3  Q3 

T 

P2  | 

Pa  Qa 


Q 1 

Q2 


Q 


Fig.  28. 


select  the  left-hand  half;  and  we  denote  the  selected  half  by  P\Q\  (Fig.  28).  If 
P\  Q\  is  the  left-hand  half,  P\  is  the  same  point  as  P. 

Similarly,  if  we  divide  P\Q\  into  two  halves,  one  at  least  of  them  must  contain 
infinitely  many  points  of  S.  We  select  the  half  P2Q2  which  does  so,  or,  if  both  do 
so,  we  select  the  left-hand  half.  Proceeding  in  this  way  we  can  define  a  sequence 
of  intervals 

PQ,  P1Q1,  P2Q2,  P3Q3,  ■ 

each  of  which  is  a  half  of  its  predecessor,  and  each  of  which  contains  infinitely 
many  points  of  S. 

The  points  P,  P\,  P2,  ■  ■  ■  progress  steadily  from  left  to  right,  and  so  Pn  tends 
to  a  limiting  position  T.  Similarly  Qn  tends  to  a  limiting  position  T' .  But 
TV  is  plainly  less  than  PnQn,  whatever  the  value  of  n;  and  PnQn ,  being  equal 
to  PQ/2n,  tends  to  zero.  Hence  T'  coincides  with  T,  and  Pn  and  Qn  both  tend 
to  T. 

Then  T  is  a  point  of  accumulation  of  S.  For  suppose  that  £  is  its  coordinate, 
and  consider  any  interval  of  the  type  [£  —  e,  £  +  e].  If  n  is  sufficiently  large,  PnQn 
will  lie  entirely  inside  this  interval.*  Hence  [£  —  e,  £  +  e]  contains  infinitely  many 
points  of  S. 

72.  The  limit  of  xn  as  n  tends  to  00.  Let  us  apply  the  results  of 
§  69  to  the  particularly  important  case  in  which  (f>(n)  =  xn.  If  x  =  1  then 
0(n)  =  1,  lim0(n)  =  1,  and  if  x  —  0  then  <p(n)  =  0,  lim  (j)(n)  =  0,  so  that 
these  special  cases  need  not  detain  us. 

First,  suppose  x  positive.  Then,  since  ( f>(n+  1)  =  xcp(n),  <f>(n)  increases 
with  n  if  x  >  1,  decreases  as  n  increases  if  x  <  1. 

*This  will  certainly  be  the  case  as  soon  as  PQ/2n  <  e. 
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If  x  >  1,  then  xn  must  tend  either  to  a  limit  (which  must  obviously 
be  greater  than  1)  or  to  +oo.  Suppose  it  tends  to  a  limit  l.  Then 
lim  (p(n  +  1)  =  lim  (b{n)  =  Z,  by  Exs.  xxv.  7;  but 

lim  tp{n  +  1)  =  lima;</>(n)  =  a:  lim  (j)(n)  =  xl, 

and  therefore  l  =  xl:  and  as  x  and  l  are  both  greater  than  1,  this  is 
impossible.  Hence 

xn  — >  +CX)  (x  >  1). 


Example.  The  reader  may  give  an  alternative  proof,  showing  by  the  bino¬ 
mial  theorem  that  x11  >  1  +  n5  if  5  is  positive  and  x  =  1  +  5,  and  so  that 

xn  — >  +oo. 


On  the  other  hand  xn  is  a  decreasing  function  if  x  <  1,  and  must  there¬ 
fore  tend  to  a  limit  or  to  —  oo.  Since  xn  is  positive  the  second  alternative 
may  be  ignored.  Thus  lima;”  =  l,  say,  and  as  above  l  —  xl,  so  that  l  must 
be  zero.  Hence 

lima;"  =  0  (0  <  x  <  1). 


Example.  Prove  as  in  the  preceding  example  that  ( l/x)n  tends  to  +oo  if 
0  <  x  <  1,  and  deduce  that  xn  tends  to  0. 

We  have  finally  to  consider  the  case  in  which  x  is  negative.  If 
—  1  <  x  <  0  and  x  =  —y,  so  that  0  <  y  <  1,  then  it  follows  from  what 
precedes  that  lim yn  =  0  and  therefore  lima:"  =  0.  If  x  —  —1  it  is  obvious 
that  xn  oscillates,  taking  the  values  —1,  1  alternatively.  Finally  if  x  <  —1, 
and  x  =  —y,  so  that  y  >  1,  then  yn  tends  to  +oo,  and  therefore  xn  takes 
values,  both  positive  and  negative,  numerically  greater  than  any  assigned 
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number.  Hence  xn  oscillates  infinitely.  To  sum  up: 


(f>(n)  =  xn  — >  Too 
lim  (f){ri)  =  1 
lim  4>{n)  =  0 
(f>(n)  oscillates  finitely 
<f>(n)  oscillates  infinitely 


0  >  1), 

{x  =  1), 

(-1  <  x  <  1), 
(x  =  -1), 

(x  <  —1). 


Examples  XXVII.*  1.  If  (f>(n)  is  positive  and  cj>{n Tl)  >  K<p(n),  where 
K  >  1,  for  all  values  of  n,  then  4>{n)  Too. 

[For 

4>(n)  >  K<j)(n  -  1)  >  K2(j)(n  -  2)  •  •  •  > 
from  which  the  conclusion  follows  at  once,  as  Kn  —>•  oo.] 

2.  The  same  result  is  true  if  the  conditions  above  stated  are  satisfied  only 
when  n  T  no- 

3.  If  4>(n)  is  positive  and  (j>(n  T  1)  <  K<f>(n),  where  0  <  K  <  1,  then 
lim  (j)(n)  =  0.  This  result  also  is  true  if  the  conditions  are  satisfied  only  when 
n  T  ji0. 

4.  If  \  fi{n  +  1)|  <  K\<f>{n)\  when  n  T  no,  and  0  <  K  <  1,  then  lim^(n)  =  0. 

5.  If  4>(n)  is  positive  and  lim{^(nT  l)}/{(/>(n)}  =  l  >  1,  then  <j>(n)  — >•  Too. 
[For  we  can  determine  71q  so  that  {<f)(n+  l)}/{^>(n)}  >  K  >  1  when  n  T  no: 

we  may,  e.g.,  take  K  halfway  between  1  and  l.  Now  apply  Ex.  1.] 

6.  If  lim{b(n  T  l)}/{0(n)}  =  l,  where  l  is  numerically  less  than  unity,  then 
lim0(n)  =  0.  [This  follows  from  Ex.  4  as  Ex.  5  follows  from  Ex.  1.] 

7.  Determine  the  behaviour,  as  n  — *  oo,  of  <j>(n)  =  nrxn,  where  r  is  any 
positive  integer. 

[If  x  =  0  then  (j>(n)  =  0  for  all  values  of  n,  and  </>(n)  — >  0.  In  all  other  cases 

4>{n  T  1)  (n+l\ 

4>{n)  ~  v^T” )  X^  X' 

First  suppose  x  positive.  Then  fi{n)  — >  Too  if  x  >  1  (Ex.  5)  and  (j>(n)  — >•  0  if 
x  <  1  (Ex.  6).  If  x  =  1,  then  (j>(n)  =  nr  — >  Too.  Next  suppose  x  negative.  Then 

*  These  examples  are  particularly  important  and  several  of  them  will  be  made  use 
of  later  in  the  text.  They  should  therefore  be  studied  very  carefully. 
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\4>(n)\  =  nr \x\n  tends  to  Too  if  |x|  A  1  and  to  0  if  \x\  <  1.  Hence  4>{n)  oscillates 
infinitely  if  x  A  —1  and  4>{n)  — >•  0  if  — 1  <  cc  <  0.] 

8.  Discuss  n~rxn  in  the  same  way.  [The  results  are  the  same,  except  that 
(p(n)  — >  0  when  x  =  1  or  —1.] 

9.  Draw  up  a  table  to  show  how  nkxn  behaves  asn->  oo,  for  all  real  values 
of  x,  and  all  positive  and  negative  integral  values  of  k. 

[The  reader  will  observe  that  the  value  of  k  is  immaterial  except  in  the 
special  cases  when  x  =  1  or  —1.  Since  lim{(?r  +  1  )/n}k  =  1,  whether  k  be 
positive  or  negative,  the  limit  of  the  ratio  (j)(n+  1  )/4>{ji)  depends  only  on  x,  and 
the  behaviour  of  4>(n)  is  in  general  dominated  by  the  factor  x11.  The  factor  nk 
only  asserts  itself  when  x  is  numerically  equal  to  1.] 

10.  Prove  that  if  x  is  positive  then  yfx  — >  1  as  n  — »  oo.  [Suppose,  e.g., 
x  >  1.  Then  x,  yfc,  y/x,  ...  is  a  decreasing  sequence,  and  yfx  >  1  for  all  values 
of  n.  Thus  yfx  — >•  l,  where  /  T  l.  But  if  l  >  1  we  can  find  values  of  n,  as  large 
as  we  please,  for  which  yfx  >  l  or  x  >  ln;  and,  since  ln  — >  +oo  as  n  — >  oo,  this 
is  impossible.] 

11.  fn  — >  1.  [For  n+y/n  +  1  <  yfn  if  (n  +  l)n  <  nn+l  or  {1  +  (l/n)}n  <  n, 
which  is  certainly  satisfied  if  n  ^  3  (see  §  73  for  a  proof).  Thus  yfn  decreases 
as  increases  from  3  onwards,  and,  as  it  is  always  greater  than  unity,  it  tends 
to  a  limit  which  is  greater  than  or  equal  to  unity.  But  if  yfn  — >  l,  where  l  >  1, 
then  n  >  ln,  which  is  certainly  untrue  for  sufficiently  large  values  of  n,  since 
ln/n  — >  Too  with  n  (Exs.  7,  8).] 

12.  \fn\  — >  Too.  [However  large  A  may  be,  n!  >  An  if  n  is  large  enough. 
For  if  un  =  An/n!  then  un+i/un  =  A/(n  T  1),  which  tends  to  zero  as  n  — >  oo, 
so  that  un  does  the  same  (Ex.  6).] 

13.  Show  that  if  —  1  <  x  <  1  then 

m(m  —  1) ...  (m  —  n  T  1 )  „  ( rn\  „ 

un  =  - j - X  =  )x 

n!  \n ) 

tends  to  zero  as  n  — >  oo. 

[If  m  is  a  positive  integer,  un  =  0  for  n  >  m.  Otherwise 

un+ 1  m  —  7i 
-  =  - —x  ->  —x, 


unless  x  =  0.] 
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73.  The  limit  of  (  1  H —  )  .  A  more  difficult  problem  which  can  be 


n 

solved  by  the  help  of  §  69  arises  when  <f>(n)  =  {1  +  l/n}n. 

It  follows  from  the  binomial  theorem*  that 

l\n  1  n(n  —  1)  1  n(n  —  1) . . .  (n  —  n  +  1)  1 

1  +  -  =l  +  n--+  \  — — ~  — ^  +  •  •  •  +  v  v  ’ 


n 


n 


1-2  n2 


1  f  1\  1 

=  1  +  1  + -  1  -  -  + - 

1  •  2  V  n  1-2-3 


1-2  ...n 

i--)(i-- 1  + 

n  j  \  n 


n" 


1  •  2  ...  n  \  n  )  V  n 


1  - 


n  —  1 


n 


The  ( p  +  l)th  term  in  this  expression,  viz. 


,-V  (i-2- 

1-2  . .  .p  \  n  )  \  n 


1  - 


p  —  1 


n 


is  positive  and  an  increasing  function  of  n,  and  the  number  of  terms  also 

(  \\n 

increases  with  n.  Hence  1  H —  increases  with  n ,  and  so  tends  to  a 

V  nJ 

limit  or  to  +oo,  as  n  — »  oo. 

But 

l\n  11  1 

1  +  nJ  <1  +  1  +  r2  +  r2_3+"'+1.2.3...n 

<1  +  1  +  V^  +  '"+2VT<3' 


Thus  (  1  H —  )  cannot  tend  to  +oo,  and  so 


lim  I  1  H —  =  e, 

n—>oo  V  n 


where  e  is  a  number  such  that  2  <  e  A  3. 


*The  binomial  theorem  for  a  positive  integral  exponent,  which  is  what  is  used  here, 
is  a  theorem  of  elementary  algebra.  The  other  cases  of  the  theorem  belong  to  the  theory 
of  infinite  series,  and  will  be  considered  later. 
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74.  Some  algebraical  lemmas.  It  will  be  convenient  to  prove  at  this 
stage  a  number  of  elementary  inequalities  which  will  be  useful  to  us  later  on. 

(i)  It  is  evident  that  if  a  >  1  and  r  is  a  positive  integer  then 

rar  >  ar-x  +  ar"2  +  •••  +  !. 


Multiplying  both  sides  of  this  inequality  by  a  —  1,  we  obtain 

rar  (a  —  1)  >  ar  —  1; 

and  adding  r{ar  —  1)  to  each  side,  and  dividing  by  r(r  +  1),  we  obtain 

ar+l  -  1  ar  -  1 


> 


r  +  1  r 

Similarly  we  can  prove  that 

1  -  f3r+l  1  -  /3r 


< 


(■ a  >  1). 


(0  <  p  <  1). 


r  +  1  r 

It  follows  that  if  r  and  s  are  positive  integers,  and  r  >  s,  then 

ar  -  1  as  -  1  l-/3r  1-PS 

-  >  - ,  —  <  — . 

r  s  r  s 

Here  0  <  /?  <  1  <  a.  In  particular,  when  s  =  1,  we  have 

ar  —  1  >  r(a  —  1),  1  —  (3r  <  r(l  —  /3). 


(1) 

(2) 

(3) 

(4) 


(ii)  The  inequalities  (3)  and  (4)  have  been  proved  on  the  supposition  that 
r  and  s  are  positive  integers.  But  it  is  easy  to  see  that  they  hold  under  the 
more  general  hypothesis  that  r  and  s  are  any  positive  rational  numbers.  Let  us 
consider,  for  example,  the  first  of  the  inequalities  (3).  Let  r  =  a/b,  s  =  c/d , 
where  a,  6,  c,  d  are  positive  integers;  so  that  ad  >  be.  If  we  put  a  =  rybd,  the 
inequality  takes  the  form 

('yad  —  1)/ ad  >  (rybc  —  l)/6c; 

and  this  we  have  proved  already.  The  same  argument  applies  to  the  remaining 
inequalities;  and  it  can  evidently  be  proved  in  a  similar  manner  that 


as  —  1  <  s(a  —  1),  1  —  (3s  >  s(l  —  /3), 


(5) 
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if  s  is  a  positive  rational  number  less  than  1. 

(iii)  In  what  follows  it  is  to  be  understood  that  all  the  letters  denote  positive 
numbers,  that  r  and  s  are  rational,  and  that  a  and  r  are  greater  than  1,  (3  and 


s  less  than  1.  Writing  l/f3  for  ct,  and  1/a  for  (3,  in  (4),  we  obtain 

ar  —  1  <  rar~1(a  —  1),  1  -  >  r^1  (1  -  (3).  (6) 

Similarly,  from  (5),  we  deduce 

as  —  1  >  saV^a  —  1),  1  —  (3s  <  s/3s-1(l  —  /3).  (7) 

Combining  (4)  and  (6),  we  see  that 

rar~1(a  —  1)  >  ar  —  1  >  r(a  —  1).  (8) 

Writing  x/y  for  a,  we  obtain 

rxr~1(x  —  y)  >  xr  —  yr  >  ryr~1(x  —  y)  (9) 

if  x  >  y  >  0.  And  the  same  argument,  applied  to  (5)  and  (7),  leads  to 

sxs^1(x  —  y)  <  xs  —  ys  <  sys^1(x  —  y).  (10) 


Examples  XXVIII.  1.  Verify  (9)  for  r  =  2,  3,  and  (10)  for  s  =  |,  1. 

2.  Show  that  (9)  and  (10)  are  also  true  if  y  >  x  >  0. 

3.  Show  that  (9)  also  holds  for  r  <  0.  [See  Chrystal’s  Algebra ,  vol.  ii, 
pp.  43-45.] 

4.  If  4>(n)  — >  l ,  where  l  >  0,  as  n  — >  oo,  then  cj)k  — >  lk ,  k  being  any  rational 
number. 

[We  may  suppose  that  k  >  0,  in  virtue  of  Theorem  III  of  §  66;  and  that 
<  0  <  2Z,  as  is  certainly  the  case  from  a  certain  value  of  n  onwards.  If  k  >  1, 

k^-1^  -l)>(j)k-  lk  >  klk-\(/  -  l ) 

klk~\l  -</>)>  lk  -  (pk  >  k(/k~l{l  -  </>), 


or 
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according  as  (f>  >  l  or  </>  <  l.  It  follows  that  the  ratio  of  \<fk  —  lk |  and  \4>  —  l\ 
lies  between  k{\l)k~l  and  k(2l)k _1.  The  proof  is  similar  when  0  <  k  <  1.  The 
result  is  still  true  when  l  =  0,  if  k  >  0.] 

5.  Extend  the  results  of  Exs.  XXVII.  7,  8,  9  to  the  case  in  which  r  or  k  are 
any  rational  numbers. 


75.  The  limit  of  n(  yfx  —  1).  If  in  the  first  inequality  (3)  of  §  74  we  put 
r  =  l/(n  —  1),  s  =  1/ro,  we  see  that 

(n-  1)(  "-■i/a-  1)  >n{$a-  1) 


when  a  >  1.  Thus  if  (j>(n)  =  n(y/a  —  1)  then  4>(n)  decreases  steadily  as  n  in¬ 
creases.  Also  <j)(n)  is  always  positive.  Hence  <f>(n)  tends  to  a  limit  l  as  n  — >  oo, 
and  l  ^  0. 

Again  if,  in  the  first  inequality  (7)  of  §  74,  we  put  s  =  1/n,  we  obtain 


n{  yfa  —  1)  >  yfa  (l - )  >1 - . 

a  a 


Thus  l  ^  1  —  (1/a)  >  0.  Hence,  if  a  >  1,  we  have 


lim  n(y/a—  1)  =  /(a), 

n— >oo 


where  f(a)  >  0. 

Next  suppose  (3  <  1,  and  let  j3  =  1/or,  then  n(>/]0  —  1)  =  —  n(y/a  —  1)/ \fa. 
Now  n(y/a  —  1)  — >  f{a),  and  (Exs.  xxvil.  10) 


1. 


Hence,  if  /3  =  1/a  <  1,  we  have 

n(  \fP  ~  1)  -A  —  /(a)- 

Finally,  if  x  =  1,  then  n(  yfx  —  1)  =  0  for  all  values  of  n. 
Thus  we  arrive  at  the  result:  the  limit 


lim  n{  yfx  —  1) 

defines  a  function  of  x  for  all  positive  values  of  x.  This  function  fix)  possesses 
the  properties 

fil/x)  =  -f(x),  /( 1)  =  0, 

and  is  positive  or  negative  according  as  x  >  1  or  x  <  1.  Later  on  we  shall  be 
able  to  identify  this  function  with  the  Napierian  logarithm  of  x. 
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Example.  Prove  that  f(xy )  =  f(x)  +  f{y).  [Use  the  equations 
f(xy)  =  limn( % fxy  —  1)  =  lim {n(\fx—  1)  ^/y  +  n( %/y  —  1)}.] 


76.  Infinite  Series.  Suppose  that  u(n)  is  any  function  of  n  defined 
for  all  values  of  n.  If  we  add  up  the  values  of  u(u)  for  v  —  1,  2,  . . .  n,  we 
obtain  another  function  of  n,  viz. 

s(n)  =  u(  1)  +  u( 2)  H - h  u(n), 

also  defined  for  all  values  of  n.  It  is  generally  most  convenient  to  alter  our 
notation  slightly  and  write  this  equation  in  the  form 

sn  =  ui  +  u2  H - b  un, 


or,  more  shortly, 


n 


v=l 


If  now  we  suppose  that  sn  tends  to  a  limit  s  when  n  tends  to  oo,  we 
have 

n 


lim  )  uv 

n— >06  • 

v=l 


=  S. 


This  equation  is  usually  written  in  one  of  the  forms 


OO 

^2uu  =  s,  Ui+U2  +  U3-\ - =  s, 

v=l 


the  dots  denoting  the  indefinite  continuance  of  the  series  of  u1  s. 

The  meaning  of  the  above  equations,  expressed  roughly,  is  that  by 
adding  more  and  more  of  the  u' s  together  we  get  nearer  and  nearer  to  the 
limit  s.  More  precisely,  if  any  small  positive  number  e  is  chosen,  we  can 
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choose  n0(e)  so  that  the  sum  of  the  first  n0(e)  terms,  or  any  of  greater 
number  of  terms,  lies  between  s  —  e  and  s  +  e;  or  in  symbols 

s-e<sn<s  +  e, 

if  n  A  no(e).  In  these  circumstances  we  shall  call  the  series 

U\  +  u2  +  ■  ■  ■ 

a  convergent  infinite  series,  and  we  shall  call  s  the  sum  of  the  series, 
or  the  sum  of  all  the  terms  of  the  series. 

Thus  to  say  that  the  series  U\  +  u2  +  . . .  converges  and  has  the  sum  s, 
or  converges  to  the  sum  s  or  simply  converges  to  s,  is  merely  another  way 
of  stating  that  the  sum  sn  =  u\  +  u2  +  •  •  •  +  un  of  the  first  n  terms  tends 
to  the  limit  s  as  n  — >  oo,  and  the  consideration  of  such  infinite  series 
introduces  no  new  ideas  beyond  those  with  which  the  early  part  of  this 
chapter  should  already  have  made  the  reader  familiar.  In  fact  the  sum  sn 
is  merely  a  function  0(n),  such  as  we  have  been  considering,  expressed  in 
a  particular  form.  Any  function  c f>{n )  may  be  expressed  in  this  form,  by 
writing 


4>{n)  =  0(1)  +  {0(2)  -  0(1)}  +  •  •  •  +  (0(n)  -  0(n  -  1)}; 

and  it  is  sometimes  convenient  to  say  that  0(n)  converges  (instead  of 
‘tends’)  to  the  limit  /,  say,  as  n  — >  oo. 

If  sn  -A  +oo  or  sn  -A  — oo,  we  shall  say  that  the  series  U\  +  u2  +  . . .  is 
divergent  or  diverges  to  +oo,  or  — oo,  as  the  case  may  be.  These  phrases 
too  may  be  applied  to  any  function  0(n):  thus  if  0(n)  -A  +oo  we  may  say 
that  0(n)  diverges  to  +oo.  If  sn  does  not  tend  to  a  limit  or  to  +oo  or 
to  —  oo,  then  it  oscillates  finitely  or  infinitely:  in  this  case  we  say  that  the 
series  U\  +  u2  +  . . .  oscillates  finitely  or  infinitely.* 

*The  reader  should  be  warned  that  the  words  ‘divergent’  and  ‘oscillatory’  are  used 
differently  by  different  writers.  The  use  of  the  words  here  agrees  with  that  of  Bromwich’s 
Infinite  Series.  In  Hobson’s  Theory  of  Functions  of  a  Real  Variable  a  series  is  said  to 
oscillate  only  if  it  oscillates  finitely ,  series  which  oscillate  infinitely  being  classed  as 
‘divergent’.  Many  foreign  writers  use  ‘divergent’  as  meaning  merely  ‘not  convergent’. 
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77.  General  theorems  concerning  infinite  series.  When  we  are 
dealing  with  infinite  series  we  shall  constantly  have  occasion  to  use  the 
following  general  theorems. 

(1)  If  U1+U2  +  .  ■  ■  is  convergent,  and  has  the  sum  s,  then  a+Mi+w2  +  .  ■  ■ 

is  convergent  and  has  the  sum  a+s.  Similarly  a+b+c-\ - \-k+Ui+u2  +  . . . 

is  convergent  and  has  the  sum  a  +  b  +  c+  --  -  +  k~\-s. 

(2)  If  ui  +  U2  +  •  •  •  is  convergent  and  has  the  sum  s,  then 
um+ 1  +  um+2  +  •  •  •  is  convergent  and  has  the  sum 

s  —  Mi  —  u2  —  ■  •  •  —  um- 

(3)  If  any  series  considered  in  (1)  or  (2)  diverges  or  oscillates,  then  so 
do  the  others. 

(4)  If  U1+U2  +  ■  ■  ■  is  convergent  and  has  the  sum  s,  then  ku\  +  ku2  + . . . 
is  convergent  and  has  the  sum  ks. 

(5)  If  the  first  series  considered  in  (4)  diverges  or  oscillates,  then  so 
does  the  second,  unless  k  —  0. 

(6)  If  +  «2  +  •  •  •  and  V\  +  u2  +  . . .  are  both  convergent,  then  the 
series  (u±  +  V\)  +  {u2  +  v2)  +  . . .  is  convergent  and  its  sum  is  the  sum  of 
the  first  two  series. 

All  these  theorems  are  almost  obvious  and  may  be  proved  at  once 
from  the  definitions  or  by  applying  the  results  of  §§  63-66  to  the  sum 
sn  =  u\  +  U2  +  ■  ■  ■  +  un.  Those  which  follow  are  of  a  somewhat  different 
character. 

(7)  If  U\  T  w2  T  •  •  •  is  convergent,  then  lim  un  =  0. 

For  un  =  sn  —  sn_i,  and  sn  and  sn_i  have  the  same  limit  s.  Hence 
lim  un  —  s  —  s  —  0. 

The  reader  may  be  tempted  to  think  that  the  converse  of  the  theorem  is  true 
and  that  if  limMn  =  0  then  the  series  ^ ^un  must  be  convergent.  That  this  is 
not  the  case  is  easily  seen  from  an  example.  Let  the  series  be 

1  +  \  +  5  +  I  +  •  • ' 

so  that  un  =  1/n.  The  sum  of  the  first  four  terms  is 

1  +  2  +  5  +  I>1  +  2  +  i  =  1  +  5  +  2- 
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The  sum  of  the  next  four  terms  is|  +  g  +  ^  +  |>g  =  |;  the  sum  of  the  next 
eight  terms  is  greater  than  =  5,  and  so  on.  The  sum  of  the  first 

4  +  4  +  8  +  16  +  •  •  •  +  2n  =  2n+1 


terms  is  greater  than 


2  +  5  +  5  +  ^5  —  i(n  +  3), 


and  this  increases  beyond  all  limit  with  n:  hence  the  series  diverges  to  +00. 

(8)  If  Ui  +  u2  +  u3  +  . . .  is  convergent,  then  so  is  any  series  formed  by 
grouping  the  terms  in  brackets  in  any  way  to  form  new  single  terms,  and 
the  sums  of  the  two  series  are  the  same. 

The  reader  will  be  able  to  supply  the  proof  of  this  theorem.  Here  again  the 
converse  is  not  true.  Thus  1  —  1  +  1  —  1  +  ...  oscillates,  while 

(1  -  1)  +  (1  -!)  +  ■■■ 


or  0  +  0  +  0  +  . . .  converges  to  0. 

(9)  If  every  term  un  is  positive  (or  zero),  then  the  series  Yhun  must 
either  converge  or  diverge  to  +00.  If  it  converges,  its  sum  must  be  positive 
(unless  all  the  terms  are  zero,  when  of  course  its  sum  is  zero). 

For  sn  is  an  increasing  function  of  n,  according  to  the  definition  of  §  69, 
and  we  can  apply  the  results  of  that  section  to  sn. 

(10 )  If  every  term  un  is  positive  (or  zero),  then  the  Jiecessary  and  suffi¬ 
cient  condition  that  the  series  ^  un  should  be  convergent  is  that  it  should 
be  possible  to  find  a  number  K  such  that  the  sum  of  any  number  of  terms 
is  less  than  K ;  and,  if  K  can  be  so  found,  then  the  sum  of  the  series  is  not 
greater  than  K . 

This  also  follows  at  once  from  §  69.  It  is  perhaps  hardly  necessary  to 
point  out  that  the  theorem  is  not  true  if  the  condition  that  every  un  is 
positive  is  not  fulfilled.  For  example 


1-1  +  1-1  +  ... 


obviously  oscillates,  sn  being  alternately  equal  to  1  and  to  0. 
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(11)  If  U\  +  u2  +  ... ,  Vi  +  v2  +  ■  ■  ■  are  two  series  of  positive  (or  zero) 
terms,  and  the  second  series  is  convergent,  and  if  un  V  Kvn,  where  K  is 
a  constant,  for  all  values  of  n,  then  the  first  series  is  also  convergent,  and 
its  sum  is  less  than  or  equal  to  K  times  that  of  the  second. 

For  if  v\  +  v2  +  •  •  •  =  t  then  v±  +  v2  +  ■  ■  ■  +  vn  ^  t  for  all  values  of  n, 

and  so  Mi  +  «2  I - +  =  Kt]  which  proves  the  theorem. 

Conversely,  ifJ2un  divergent,  andvn  ^  Kun,  then  ^2vn  is  divergent. 

78.  The  infinite  geometrical  series.  We  shall  now  consider  the 
‘geometrical’  series,  whose  general  term  is  un  =  rn_1.  In  this  case 

sn  =  1  +  r  +  r2  +  •  •  •  +  rni  =  (1  —  rn)/(  1  —  r), 

except  in  the  special  case  in  which  r  =  1,  when 

sn  =  l  +  l  +  --  -  +  l  =  n. 

In  the  last  case  sn  — s-  +oo.  In  the  general  case  sn  will  tend  to  a  limit  if  and 
only  if  rn  does  so.  Referring  to  the  results  of  §  72  we  see  that  the  series 

1  +  r  +  r2  +  . . .  is  convergent  and  has  the  sum  1/(1  —  r)  if  and  only  if 
-1  <  r  <  1. 

If  r  ^  1,  then  sn  ^  n,  and  so  sn  — >  +oo;  i.e.  the  series  diverges  to  +oo. 

If  r  =  —1,  then  sn  =  1  or  sn  =  0  according  as  n  is  odd  or  even:  i.e. 

sn  oscillates  finitely.  If  r  <  —1,  then  sn  oscillates  infinitely.  Thus,  to 
sum  up,  the  series  1  +  r  +  r2  +  . . .  diverges  to  +oo  if  r  ^  1,  converges 
to  1/(1  —  r)  if  —1  <  r  <  1,  oscillates  finitely  if  r  =  —1,  and  oscillates 
infinitely  if  r  <  —  1. 

Examples  XXIX.  1.  Recurring  decimals.  The  commonest  exam¬ 
ple  of  an  infinite  geometric  series  is  given  by  an  ordinary  recurring  decimal. 
Consider,  for  example,  the  decimal  .21713.  This  stands,  according  to  the  ordi¬ 
nary  rules  of  arithmetic,  for 

2  1  7  1  3  1  3  217  13  /  /  1  \  _  2687 

To+I^+To3+Io^+To5+To6+To7+'"  “  iooo+Tov  ^  -  ToV  “  12,375' 
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The  reader  should  consider  where  and  how  any  of  the  general  theorems  of  §  77 
have  been  used  in  this  reduction. 

2.  Show  that  in  general 


.d\CL2  •  •  •  drnOL\Oi2  •  •  • 


d\d2  •  •  •  djYiOix  . . .  OLfi  d\d2  •  •  •  dn 

99  ...  900  ...  0 


the  denominator  containing  n  9’s  and  m  0’s. 

3.  Show  that  a  pure  recurring  decimal  is  always  equal  to  a  proper  fraction 
whose  denominator  does  not  contain  2  or  5  as  a  factor. 

4.  A  decimal  with  m  non-recurring  and  n  recurring  decimal  figures  is  equal 
to  a  proper  fraction  whose  denominator  is  divisible  by  2m  or  5m  but  by  no  higher 
power  of  either. 

5.  The  converses  of  Exs.  3,  4  are  also  true.  Let  r  =  p/q,  and  suppose 
first  that  q  is  prime  to  10.  If  we  divide  all  powers  of  10  by  q  we  can  obtain 
at  most  q  different  remainders.  It  is  therefore  possible  to  find  two  numbers 
n\  and  112,  where  n\  >  ri2,  such  that  10ni  and  10n2  give  the  same  remainder. 
Hence  10ni  —  10n2  =  lO"-2  (10ni^n2  —  1)  is  divisible  by  q.  and  so  10n  —  1,  where 
n  =  n\  —112,  is  divisible  by  q.  Hence  r  may  be  expressed  in  the  form  P/(10n  —  1), 
or  in  the  form 

P  P 

i.e.  as  a  pure  recurring  decimal  with  n  figures.  If  on  the  other  hand  q  =  2"5^Q, 
where  Q  is  prime  to  10,  and  m  is  the  greater  of  a  and  (3,  then  10mr  has  a 
denominator  prime  to  10,  and  is  therefore  expressible  as  the  sum  of  an  integer 
and  a  pure  recurring  decimal.  But  this  is  not  true  of  10^r,  for  any  value  of  fj, 
less  than  m;  hence  the  decimal  for  r  has  exactly  m  non-recurring  figures. 

6.  To  the  results  of  Exs.  2-5  we  must  add  that  of  Ex.  1.  3.  Finally,  if  we 
observe  that 

-  9  9  9 

■9  “  TO  +  HE  +  I(H  +  "  '  “ 

we  see  that  every  terminating  decimal  can  also  be  expressed  as  a  mixed  recurring 
decimal  whose  recurring  part  is  composed  entirely  of  9’s.  For  example,  .217  = 
.2169.  Thus  every  proper  fraction  can  be  expressed  as  a  recurring  decimal,  and 
conversely. 

7.  Decimals  in  general.  The  expression  of  irrational  numbers  as 
non-recurring  decimals.  Any  decimal,  whether  recurring  or  not,  corresponds 
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to  a  definite  number  between  0  and  1.  For  the  decimal  .01020304  . . .  stands  for 
the  series 

Ol  _02_  «3 

10  +  102  +  103  +  ■  "  ' 

Since  all  the  digits  ar  are  positive,  the  sum  sn  of  the  first  n  terms  of  this  series 
increases  with  n,  and  it  is  certainly  not  greater  than  .9  or  1.  Hence  sn  tends  to 
a  limit  between  0  and  1. 

Moreover  no  two  decimals  can  correspond  to  the  same  number  (except  in 
the  special  case  noticed  in  Ex.  6).  For  suppose  that  .040203  . . . ,  .616263  . . .  are 
two  decimals  which  agree  as  far  as  the  figures  ar_i,  6r_i,  while  ar  >  br.  Then 
ar  ^  br  +  1  >  6r.6r+i6r+2  •  •  •  (unless  6r+i,  br+2,  ■  ■  .  are  all  9’s),  and  so 

.0102  •  •  •  O7.O7.-j-1  *  *  *  *6162  . . .  67.67._j_1 .... 

It  follows  that  the  expression  of  a  rational  fraction  as  a  recurring  decimal  (Exs. 
2-6)  is  unique.  It  also  follows  that  every  decimal  which  does  not  recur  represents 
some  irrational  number  between  0  and  1.  Conversely,  any  such  number  can  be 
expressed  as  such  a  decimal.  For  it  must  lie  in  one  of  the  intervals 

0,  1/10;  1/10,  2/10;  ...;  9/10,  1. 

If  it  lies  between  r/10  and  (r  +  1)/ 10,  then  the  first  figure  is  r.  By  subdividing 
this  interval  into  10  parts  we  can  determine  the  second  figure;  and  so  on.  But 
(Exs.  3,  4)  the  decimal  cannot  recur.  Thus,  for  example,  the  decimal  1.414 . . . , 
obtained  by  the  ordinary  process  for  the  extraction  of  \/2,  cannot  recur. 

8.  The  decimals  .101  001  000  100  001  0  . . .  and  .202  002  000  200  002  0  . . . ,  in 
which  the  number  of  zeros  between  two  l’s  or  2’s  increases  by  one  at  each  stage, 
represent  irrational  numbers. 

9.  The  decimal  .111  010  100  010  10  ... ,  in  which  the  nth  figure  is  1  if  n  is 
prime,  and  zero  otherwise,  represents  an  irrational  number.  [Since  the  number 
of  primes  is  infinite  the  decimal  does  not  terminate.  Nor  can  it  recur:  for  if  it 
did  we  could  determine  m  and  p  so  that  m,  m  +  p,  rn  +  2 p,  m  +  3 p,  . . .  are  all 
prime  numbers;  and  this  is  absurd,  since  the  series  includes  m  +  mp.]* 


*  All  the  results  of  Exs.  XXIX  may  be  extended,  with  suitable  modifications,  to 
decimals  in  any  scale  of  notation.  For  a  fuller  discussion  see  Bromwich,  Infinite  Series, 
Appendix  I. 
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Examples  XXX.  1.  The  series  rm  +  rm+1  +  . . .  is  convergent  if 
—  1  <  r  <  1,  and  its  sum  is  1/(1  —  r)  —  1  —  r  —  •  •  •  —  rm~l  (§  77,  (2)). 

2.  The  series  rm  +  rm+1  +  . . .  is  convergent  if  —1  <  r  <  1,  and  its  sum  is 
rm/(  1  —  r)  (§  77,  (4)).  Verify  that  the  results  of  Exs.  1  and  2  are  in  agreement. 

3.  Prove  that  the  series  1  +  2r  +  2r2  +  . . .  is  convergent,  and  that  its 
sum  is  (1  +  r)/(l  —  r),  ( a )  by  writing  it  in  the  form  —1  +  2(1  +  r  +  r2  +  . . . ), 
(/l)  by  writing  it  in  the  form  1  +  2 (r  +  r2  +  . . . ),  (7)  by  adding  the  two  series 
1  +  r  +  r2  +  . . . ,  r  +  r2  +  . . . .  In  each  case  mention  which  of  the  theorems  of  §  77 
are  used  in  your  proof. 

4.  Prove  that  the  ‘arithmetic’  series 

a  +  (o  +  b)  +  (a  +  2b)  +  . . . 

is  always  divergent,  unless  both  a  and  b  are  zero.  Show  that,  if  b  is  not  zero, 
the  series  diverges  to  +00  or  to  —00  according  to  the  sign  of  b,  while  if  b  =  0  it 
diverges  to  +00  or  —00  according  to  the  sign  of  a. 

5.  What  is  the  sum  of  the  series 

(1  -  r)  +  (r  -  r2)  +  (r2  -  r3)  +  . . . 

when  the  series  is  convergent?  [The  series  converges  only  if  —  1  <  r  ^  1.  Its  sum 
is  1,  except  when  r  =  1,  when  its  sum  is  0.] 

6.  Sum  the  series 

r2  r2 

v  H - 1 - h  .  .  .  . 

1  +  r2  (1  +  r2)2 

[The  series  is  always  convergent.  Its  sum  is  1  +  r2,  except  when  r  =  0,  when  its 
sum  is  0.] 

7.  If  we  assume  that  1  +  r  +  r2  +  . . .  is  convergent  then  we  can  prove  that 
its  sum  is  1/(1  —  r)  by  means  of  §  77,  (1)  and  (4).  For  ifl  +  r  +  r2  +  --  -  =  s  then 

s  =  1  +  r(l  +  r2  +  ...)  =  1  +  rs. 


8.  Sum  the  series 


r  + 


1  +  r 


+ 


r 

(1  +  r)2 


+  ... 
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when  it  is  convergent.  [The  series  is  convergent  if  —1  <  1/(1  +  r)  <  1,  i.e.  if 
r  <  —2  or  if  r  >  0,  and  its  sum  is  1  +  r.  It  is  also  convergent  when  r  =  0,  when 
its  sum  is  0.] 

9.  Answer  the  same  question  for  the  series 


r  r 

1  +  r  +  (1  +  r )2 
r  /  r  \ 
lTr^lTry 


r  T  t - 1"  — - ut  +  •  •  •  j 


1  ~r  (1  -r) 


1  +  - H  i  , 

1  —  r  \  1  —  r 


+  .... 


10.  Consider  the  convergence  of  the  series 

(1  +  r)  +  (r2  +  r3)  +  . . . ,  (1  +  r  T  r2)  +  (r3  +  r4  +  r5)  +  . . . , 

1  -  2r  +  r2  +  r3  -  2r4  T  r5  T  . . . ,  (1  -  2r  +  r2)  +  (r3  -  2r4  +  r5)  +  . . . , 

and  find  their  sums  when  they  are  convergent. 

11.  If  0  ^  an  ^  1  then  the  series  a o  +  ai  r  +  a^r1  +  . . .  is  convergent  for 
0  ^  r  <  1,  and  its  sum  is  not  greater  than  1/(1  —  r). 

12.  If  in  addition  the  series  ao  +  ai  +  a2  +  . . .  is  convergent,  then  the  series 
ao  +  air  +  a2r2  +  . . .  is  convergent  for  0  ^  r  ^  1,  and  its  sum  is  not  greater  than 
the  lesser  of  ao  +  ai  +  a2  +  . . .  and  1/(1  —  r). 

13.  The  series 

1  1  1 

lT  —  T - T - T  •  ■  ■ 

1  1-2  1-2-3 

is  convergent.  [For  1/(1  •  2  . . .  n)  T  l/2n_1.] 

14.  The  series 

1  1  |  1  i  1  |  1 

+  L2  +  l-2-3-4  +  “’’  1  +  1-2-3  +  1-2-3-4-5  +'" 

are  convergent  . 

15.  The  general  harmonic  series 

1  1  1 

- 1 - r  H - T7  T  •  •  •  , 

a  a  +  b  a  T  26 

where  a  and  6  are  positive,  diverges  to  Too. 

[For  un  =  l/(a  T  nb)  >  l/{n(a  T  6)}.  Now  compare  with  1  T  \  +  g  +  ■  •  •  •] 
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16.  Show  that  the  series 

(no  -  u\)  +  (m  -  n2)  +  (u2~u3)  +  ... 

is  convergent  if  and  only  if  un  tends  to  a  limit  as  n  — >  oo. 

17.  If  u\  +  U2  +  u3  + . . .  is  divergent  then  so  is  any  series  formed  by  grouping 
the  terms  in  brackets  in  any  way  to  form  new  single  terms. 

18.  Any  series,  formed  by  taking  a  selection  of  the  terms  of  a  convergent 
series  of  positive  terms,  is  itself  convergent. 


79.  The  representation  of  functions  of  a  continuous  real  vari¬ 
able  by  means  of  limits.  In  the  preceding  sections  we  have  frequently 
been  concerned  with  limits  such  as 

lim  4>n{x), 

n— >og 


and  series  such  as 


ui(x)  +  u2(x)  -t - =  lim  +  u2(x)  H - b  un(x)}, 

n— >oo 


in  which  the  function  of  n  whose  limit  we  are  seeking  involves,  besides  n, 
another  variable  x.  In  such  cases  the  limit  is  of  course  a  function  of  x. 
Thus  in  §  75  we  encountered  the  function 

f(x )  =  lim  n(\fx  —  1)  : 

n— >oo 

and  the  sum  of  the  geometrical  series  1  +  x  +  x2  +  . . .  is  a  function  of  x, 
viz.  the  function  which  is  equal  to  1/(1  —  x)  if— 1  <  x  <  1  and  is  undefined 
for  all  other  values  of  x. 

Many  of  the  apparently  ‘arbitrary’  or  ‘unnatural’  functions  considered 
in  Ch.  II  are  capable  of  a  simple  representation  of  this  kind,  as  will  appear 
from  the  following  examples. 
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Examples  XXXI.  1.  (j>n{x)  =  x.  Here  n  does  not  appear  at  all  in  the 

expression  of  4>n(x),  and  4>(x)  =  lim0re(x)  =  x  for  all  values  of  x. 

2.  4>n(x )  =  x/n.  Here  4>{x)  =  lim</>n(x)  =  0  for  all  values  of  x. 

3.  4>n{x)  =  nx.  If  x  >  0,  i i>n{x)  —>  +oo;  if  x  <  0,  </>n(x)  ~ >  — oo:  only  when 
x  =  0  has  < i>n{x )  a  limit  (viz.  0)  as  n  — >  oo.  Thus  </>(x)  =  0  when  x  =  0  and  is 
not  defined  for  any  other  value  of  x. 

4.  (j>n{x)  =  1/nx,  nx/(nx  +  1). 

5.  4>n(x)  =  xn.  Here  (j>(x)  =  0,  (—1  <  x  <  1);  (j>{x)  =  1,  (x  =  1);  and  (j>(x) 
is  not  defined  for  any  other  value  of  x. 

6.  4>n(x )  =  xn(l  —  x).  Here  <f>(x)  differs  from  the  4>(x)  of  Ex.  5  in  that  it 
has  the  value  0  when  x  =  1. 

7.  (j>n(x)  =  xn/n.  Here  <f>(x)  differs  from  the  4>(x)  of  Ex.  6  in  that  it  has 
the  value  0  when  x  =  —  1  as  well  as  when  x  =  1. 

8.  4>n{x)  =  xn/(xn  +  1).  [<f>(x)  =  0,  (-1  <  x  <  1);  4>(x)  =  1,  (x  =  1); 
(j>(x)  =  1,  (x  <  —  1  or  x  >  1);  and  (j)(x)  is  not  defined  when  x  =  —1.] 

9.  (/>n(x)  =  Xn/ ( Xn  —  1),  l/(xn  +  1),  l/(xn  —  1),  l/(xn  +  X~n),  1  /  (xn  —  X~n) . 

10.  (j>n(x)  =  (x11  —  l)/{xn  +  1),  (nxn  —  l)/(nxn  +  1),  (xn  —  n)/(xn  +  n).  [In 
the  first  case  4>{x)  =  1  when  |x|  >  1,  <j>{x)  =  —1  when  |x|  <  1,  cj)(x)  =  0  when 
x  =  1  and  <f>(x)  is  not  defined  when  x  =  —  1.  The  second  and  third  functions 
differ  from  the  first  in  that  they  are  defined  both  when  x  =  1  and  when  x  =  —  1: 
the  second  has  the  value  1  and  the  third  the  value  —1  for  both  these  values  of  x.\ 

11.  Construct  an  example  in  which  <j>{x)  =  1,  (|x|  >  1);  4>(x)  =  —1,  (|x|  <  1); 
and  4>{x)  =  0,  (x  =  1  and  x  =  —  1). 

12.  <j>n(x)  =  x{(x2n  —  l)/(x2?l  +  l)}2,  n/(xn  +  x~n  +  n). 

13.  4>n(x)  =  {xnf(x)  +  g{x)}/{xn  +  1).  [Here  (j)(x)  =  f(x),  (\x\  >  1); 
4>(x)  =  g(x),  (|x|  <  1);  4>(x)  =  \{f{x)  +  g(x)},  (x  =  1);  and  <j>{x)  is  undefined 
when  x  =  —  1.] 

14.  4>n{x)  =  (2/ir)  arctan(?zx).  [(j>{x)  =  1,  (x  >  0);  (j>(x)  =  0,  (x  =  0); 
4>(x)  =  —  1,  (x  <  0).  This  function  is  important  in  the  Theory  of  Numbers,  and 
is  usually  denoted  by  sgn  x.] 

15.  4>n(x)  =  sinnx7r.  \4>{x)  =  0  when  x  is  an  integer;  and  <j>(x)  is  otherwise 
undefined  (Ex.  XXIV.  7).] 

16.  If  4>n(x )  =  sin(?r!x7r)  then  4>(x)  =  0  for  all  rational  values  of  x 
(Ex.  XXIV.  14).  [The  consideration  of  irrational  values  presents  greater  difficul- 
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17.  4>n(x)  =  (cos* 1 2  xir)n.  \(j){x )  =  0  except  when  x  is  integral,  when  cj>(x)  =  1.] 

18.  If  N  ^  1752  then  the  number  of  days  in  the  year  N  A.D.  is 

lim{365  +  (cos2  \Nt\ r)ra  —  (cos2  ^ Nir)n  +  (cos2  ^ Nir)n}. 


80.  The  bounds  of  a  bounded  aggregate.  Let  S  be  any  system  or 
aggregate  of  real  numbers  s.  If  there  is  a  number  K  such  that  s  ^  K  for  every  s 
of  S,  we  say  that  S  is  bounded  above.  If  there  is  a  number  k  such  that  s  ^  k 
for  every  s,  we  say  that  S  is  bounded  below.  If  S  is  both  bounded  above  and 
bounded  below,  we  say  simply  that  S  is  bounded. 

Suppose  first  that  S  is  bounded  above  (but  not  necessarily  below).  There 
will  be  an  infinity  of  numbers  which  possess  the  property  possessed  by  K\  any 
number  greater  than  K,  for  example,  possesses  it.  We  shall  prove  that  among 
these  numbers  there  is  a  least*  which  we  shall  call  M.  This  number  M  is  not 
exceeded  by  any  member  of  S,  but  every  number  less  than  M  is  exceeded  by  at 
least  one  member  of  S. 

We  divide  the  real  numbers  £  into  two  classes  L  and  R,  putting  £  into  L  or  R 
according  as  it  is  or  is  not  exceeded  by  members  of  S.  Then  every  £  belongs  to 
one  and  one  only  of  the  classes  L  and  R.  Each  class  exists;  for  any  number  less 
than  any  member  of  S  belongs  to  L,  while  K  belongs  to  R.  Finally,  any  member 
of  L  is  less  than  some  member  of  S,  and  therefore  less  than  any  member  of  R. 
Thus  the  three  conditions  of  Dedekind’s  Theorem  (§  17)  are  satisfied,  and  there 
is  a  number  M  dividing  the  classes. 

The  number  M  is  the  number  whose  existence  we  had  to  prove.  In  the  first 
place,  M  cannot  be  exceeded  by  any  member  of  S.  For  if  there  were  such  a 
member  s  of  S,  we  could  write  s  =  M  +  rj,  where  g  is  positive.  The  number 
M  +  Ir/  would  then  belong  to  L,  because  it  is  less  than  s,  and  to  R,  because  it 
is  greater  than  M;  and  this  is  impossible.  On  the  other  hand,  any  number  less 

*An  infinite  aggregate  of  numbers  does  not  necessarily  possess  a  least  member.  The 
set  consisting  of  the  numbers 

1  -  -  - 

2  3  n 

for  example,  has  no  least  member. 
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than  M  belongs  to  L,  and  is  therefore  exceeded  by  at  least  one  member  of  S. 
Thus  M  has  all  the  properties  required. 

This  number  M  we  call  the  upper  bound  of  S,  and  we  may  enunciate  the  fol¬ 
lowing  theorem.  Any  aggregate  S  which  is  bounded  above  has  an  upper  bound  M. 
No  member  of  S  exceeds  M;  but  any  number  less  than  M  is  exceeded  by  at  least 
one  member  of  S. 

In  exactly  the  same  way  we  can  prove  the  corresponding  theorem  for  an 
aggregate  bounded  below  (but  not  necessarily  above) .  Any  aggregate  S  which  is 
bounded  below  has  a  lower  bound  m.  No  member  of  S  is  less  than  m;  but  there 
is  at  least  one  member  of  S  which  is  less  than  any  number  greater  than  rn. 

It  will  be  observed  that,  when  S  is  bounded  above,  M  ff  K,  and  when  S  is 
bounded  below,  m  ^  k.  When  S  is  bounded,  k  ^  m  ^  M  ^  K . 

81.  The  bounds  of  a  bounded  function.  Suppose  that  f(n)  is  a 
function  of  the  positive  integral  variable  n.  The  aggregate  of  all  the  values  <f>(n) 
defines  a  set  S,  to  which  we  may  apply  all  the  arguments  of  §  80.  If  S  is  bounded 
above,  or  bounded  below,  or  bounded,  we  say  that  4>{n)  is  bounded  above,  or 
bounded  below,  or  bounded.  If  <f>(n)  is  bounded  above,  that  is  to  say  if  there  is 
a  number  K  such  that  4>(n)  ^  K  for  all  values  of  n,  then  there  is  a  number  M 
such  that 

(i)  4>(n)  ^  M  for  all  values  of  n; 

(ii)  if  e  is  any  positive  number  then  (f{n)  >  M  —  e  for  at  least  one  value 
of  n.  This  number  M  we  call  the  upper  bound  of  4>(n).  Similarly,  if  <f(n)  is 
bounded  below,  that  is  to  say  if  there  is  a  number  k  such  that  (f{n)  ^  k  for  all 
values  of  n,  then  there  is  a  number  m  such  that 

(i)  4>(n)  ^  m  for  all  values  of  n; 

(ii)  if  e  is  any  positive  number  then  <j>(n)  <  m  +  e  for  at  least  one  value 
of  n.  This  number  m  we  call  the  lower  bound  of  4>(n). 

If  K  exists,  M  ^  K]  if  k  exists,  m  ^  k]  and  if  both  k  and  K  exist  then 

kSm^M  S  K. 

82.  The  limits  of  indetermination  of  a  bounded  function.  Sup¬ 
pose  that  f{n)  is  a  bounded  function,  and  M  and  m  its  upper  and  lower  bounds. 
Let  us  take  any  real  number  £,  and  consider  now  the  relations  of  inequality  which 
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may  hold  between  £  and  the  values  assumed  by  <f{n)  for  large  values  of  n.  There 
are  three  mutually  exclusive  possibilities: 

(1)  £  =  (l)(n)  f°r  all  sufficiently  large  values  of  n; 

(2)  £  ^  (f>(n)  for  all  sufficiently  large  values  of  n; 

(3)  £  <  <j>(n)  for  an  infinity  of  values  of  n,  and  also  £  >  (f>(n)  for  an  infinity 
of  values  of  n. 

In  case  (1)  we  shall  say  that  £  is  a  superior  number,  in  case  (2)  that  it  is  an 
inferior  number,  and  in  case  (3)  that  it  is  an  intermediate  number.  It  is  plain 
that  no  superior  number  can  be  less  than  m,  and  no  inferior  number  greater 
than  M. 

Let  us  consider  the  aggregate  of  all  superior  numbers.  It  is  bounded  below, 
since  none  of  its  members  are  less  than  m,  and  has  therefore  a  lower  bound, 
which  we  shall  denote  by  A.  Similarly  the  aggregate  of  inferior  numbers  has  an 
upper  bound,  which  we  denote  by  A. 

We  call  A  and  A  respectively  the  upper  and  lower  limits  of  indetermination 
of  f>{n)  as  n  tends  to  infinity,  and  write 

A  =  lim^(n),  A  =  lim</>(n). 

These  numbers  have  the  following  properties: 

(1)  mS  A  ^  A  ^  M; 

(2)  A  and  A  are  the  upper  and  lower  bounds  of  the  aggregate  of  intermediate 
numbers,  if  any  such  exist; 

(3)  if  e  is  any  positive  number,  then  £>(n)  <  A  +  e  for  all  sufficiently  large 
values  of  n,  and  <f>(n)  >  A  —  e  for  an  infinity  of  values  of  n; 

(4)  similarly  <j>(n)  >  A  —  e  for  all  sufficiently  large  values  of  n,  and 
4>(n)  <  A  +  e  for  an  infinity  of  values  of  n; 

(5)  the  necessary  and  sufficient  condition  that  fin)  should  tend  to  a  limit 
is  that  A  =  A,  and  in  this  case  the  limit  is  l,  the  common  value  of  A  and  A. 

Of  these  properties,  (1)  is  an  immediate  consequence  of  the  definitions;  and 
we  can  prove  (2)  as  follows.  If  A  =  A  =  l,  there  can  be  at  most  one  intermediate 
number,  viz.  I,  and  there  is  nothing  to  prove.  Suppose  then  that  A  >  A.  Any 
intermediate  number  £  is  less  than  any  superior  and  greater  than  any  inferior 
number,  so  that  A  ^  £  ^  A.  But  if  A  <£<  A  then  £  must  be  intermediate,  since 
it  is  plainly  neither  superior  nor  inferior.  Hence  there  are  intermediate  numbers 
as  near  as  we  please  to  either  A  or  A. 
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To  prove  (3)  we  observe  that  A  +  e  is  superior  and  A  —  e  intermediate  or 
inferior.  The  result  is  then  an  immediate  consequence  of  the  definitions;  and  the 
proof  of  (4)  is  substantially  the  same. 

Finally  (5)  may  be  proved  as  follows.  If  A  =  A  =  l,  then 

l  —  e  <  4>(n)  <  l  +  e 

for  every  positive  value  of  e  and  all  sufficiently  large  values  of  n,  so  that  (j>(n)  — > 

l.  Conversely,  if  <fi(n)  —>  l,  then  the  inequalities  above  written  hold  for  all 
sufficiently  large  values  of  n.  Hence  l  —  e  is  inferior  and  l  +  e  superior,  so  that 

X^l-e,  A  ^  l  +  e, 

and  therefore  A  —  A  ^  2e.  As  A  —  A  ^  0,  this  can  only  be  true  if  A  =  A. 

Examples  XXXII.  1.  Neither  A  nor  A  is  affected  by  any  alteration  in 
any  finite  number  of  values  of  <j>(n). 

2.  If  4>(n)  =  a  for  all  values  of  n,  then  m  =  A  =  A  =  M  =  a. 

3.  If  4>(n)  =  1/n,  then  m  =  A  =  A  =  0  and  M  =  1. 

4.  If  </>(n)  =  (— l)n,  then  m  =  A  =  —  1  and  A  =  M  =  1. 

5.  If  4>{n)  =  (— 1  )n/n,  then  rn  =  — 1,  A  =  A  =  0,  M  = 

6.  If  (j){n)  =  (— l)n{l  +  (1/n)},  then  m  =  —2,  A  =  —1,  A  =  1,  M  =  §. 

7.  Let  <f>(n)  =  sin  ndir.  where  6  >  0.  If  6  is  an  integer  then  m  =  A  =  A  = 

M  =  0.  If  6  is  rational  but  not  integral  a  variety  of  cases  arise.  Suppose,  e.g., 
that  9  =  p/q,  p  and  q  being  positive,  odd,  and  prime  to  one  another,  and  q  >  1. 
Then  4>(n)  assumes  the  cyclical  sequence  of  values 

sin(p7r/g),  sin(2p7r/g),  ...,  sin{(2g  —  l)pir/q},  sm(2qpir/q),  .... 

It  is  easily  verified  that  the  numerically  greatest  and  least  values  of  (f>(n)  are 
cos(7t/2 q)  and  —  cos(7r/2 q),  so  that 

m  =  A  =  —  cos(7t/2  q),  A  =  M  =  cos(7r/2g). 

The  reader  may  discuss  similarly  the  cases  which  arise  when  p  and  q  are  not 
both  odd. 
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The  case  in  which  6  is  irrational  is  more  difficult:  it  may  be  shown  that  in 
this  case  m  =  A  =  —1  and  A  =  M  =  1.  It  may  also  be  shown  that  the  values 
of  4>(n)  are  scattered  all  over  the  interval  [—1, 1]  in  such  a  way  that,  if  £  is  any 
number  of  the  interval,  then  there  is  a  sequence  n\,  ri2,  ■  ■  ■  such  that  <j){nk)  — >  £ 
as  k  — >  oo* 

The  results  are  very  similar  when  4>(n)  is  the  fractional  part  of  nO. 

83.  The  general  principle  of  convergence  for  a  bounded  func¬ 
tion.  The  results  of  the  preceding  sections  enable  us  to  formulate  a  very  im¬ 
portant  necessary  and  sufficient  condition  that  a  bounded  function  <p(n)  should 
tend  to  a  limit,  a  condition  usually  referred  to  as  the  general  principle  of  con¬ 
vergence  to  a  limit. 

Theorem  1 .  The  necessary  and  sufficient  condition  that  a  bounded  func¬ 
tion  (j>(n)  should  tend  to  a  limit  is  that,  when  any  positive  number  e  is  given,  it 
should  be  possible  to  find  a  number  no(e)  such  that 

\<j}(n2)  -  <Xm)|  <  e 

for  all  values  of  n\  and  n2  such  that  ri2  >  n\  ^  no(e). 

In  the  first  place,  the  condition  is  necessary.  For  if  <f(n)  — >  l  then  we  can 
find  no  so  that 

l  —  \e  <  4>(n)  <1  +  \e 

when  n  ^  no,  and  so 

\(t>{n2)  -  f{ni)\  <  e  (1) 

when  ni  ^  no  and  n2  ^  no- 

In  the  second  place,  the  condition  is  sufficient.  In  order  to  prove  this  we 
have  only  to  show  that  it  involves  A  =  A.  But  if  A  <  A  their  there  are,  however 
small  e  may  be,  infinitely  many  values  of  n  such  that  <f{n)  <  A  +  e  and  infinitely 
many  such  that  </>(n)  >  A  —  e;  and  therefore  we  can  find  values  of  ni  and  n2, 
each  greater  than  any  assigned  number  no,  and  such  that 

0(n2)  -  4>{ni)  >  A  -  A  -  2e, 

which  is  greater  than  ^(A  —  A)  if  e  is  small  enough.  This  plainly  contradicts  the 
inequality  (1).  Hence  A  =  A,  and  so  4>(n)  tends  to  a  limit. 

*  A  number  of  simple  proofs  of  this  result  are  given  by  Hardy  and  Littlewood,  “Some 
Problems  of  Diophantine  Approximation” ,  Acta  Mathematica ,  vol.  xxxvii. 
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84.  Unbounded  functions.  So  far  we  have  restricted  ourselves  to 
bounded  functions;  but  the  ‘general  principle  of  convergence’  is  the  same  for 
unbounded  as  for  bounded  functions,  and  the  words  ‘a  bounded  function ’  may 
be  omitted  from  the  enunciation  of  Theorem  1. 

In  the  first  place,  if  tends  to  a  limit  l  then  it  is  certainly  bounded;  for 
all  but  a  finite  number  of  its  values  are  less  than  l  +  e  and  greater  than  l  —  e. 

In  the  second  place,  if  the  condition  of  Theorem  1  is  satisfied,  we  have 

\(f(n2)  -  4>(ni)\  <  e 

whenever  n\  ^  no  and  n2  ^  no-  Let  us  choose  some  particular  value  ni  greater 
than  no-  Then 

<t>{nf)  -  e<  fi(n2)  <  +  e 

when  n2  =  no-  Hence  4>{n)  is  bounded;  and  so  the  second  part  of  the  proof  of 
the  last  section  applies  also. 

The  theoretical  importance  of  the  ‘general  principle  of  convergence’  can 
hardly  be  overestimated.  Like  the  theorems  of  §  69,  it  gives  us  a  means  of 
deciding  whether  a  function  c />(n )  tends  to  a  limit  or  not,  without  requiring  us 
to  be  able  to  tell  beforehand  what  the  limit,  if  it  exists,  must  be;  and  it  has 
not  the  limitations  inevitable  in  theorems  of  such  a  special  character  as  those  of 
§  69.  But  in  elementary  work  it  is  generally  possible  to  dispense  with  it,  and  to 
obtain  all  we  want  from  these  special  theorems.  And  it  will  be  found  that,  in 
spite  of  the  importance  of  the  principle,  practically  no  applications  are  made  of 
it  in  the  chapters  which  follow.*  We  will  only  remark  that,  if  we  suppose  that 

4>(n)  =  sn  =  m  +  u2  H - b  un, 

we  obtain  at  once  a  necessary  and  sufficient  condition  for  the  convergence  of  an 
infinite  series,  viz: 

Theorem  2.  The  necessary  and  sufficient  condition  for  the  convergence  of 
the  series  u\  +  u2  +  •  • .  is  that,  given  any  positive  number  e,  it  should  be  possible 
to  find  no  so  that 

I'Mni+l  +  un  1+2  +  •  •  •  +  Un2  \  <  € 

for  all  values  of  n\  and  n2  such  that  n2  >  n\  ^  no- 


A  few  proofs  given  in  Ch.  VIII  can  be  simplified  by  the  use  of  the  principle. 
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85.  Limits  of  complex  functions  and  series  of  complex  terms. 

In  this  chapter  we  have,  up  to  the  present,  concerned  ourselves  only  with 
real  functions  of  n  and  series  all  of  whose  terms  are  real.  There  is  however 
no  difficulty  in  extending  our  ideas  and  definitions  to  the  case  in  which  the 
functions  or  the  terms  of  the  series  are  complex. 

Suppose  that  (f>(n)  is  complex  and  equal  to 

p(n)  +  ia(n), 

where  p(n),  a(n)  are  real  functions  of  n.  Then  if  p(n)  and  a(n)  converge 
respectively  to  limits  r  and  s  as  n  -A  oo,  we  shall  say  that  <ffn)  converges 
to  the  limit  l  —  r  +  is,  and  write 

lim  4>{n)  =  l. 

Similarly,  when  un  is  complex  and  equal  to  vn  +  iwn,  we  shall  say  that  the 
series 

Ul  +  u2  +  u3  +  . . . 

is  convergent  and  has  the  sum  l  —  r  +  is,  if  the  series 


V1+V2  +  V3  + W1+W2+W3  +  ... 

are  convergent  and  have  the  sums  r,  s  respectively. 

To  say  that  u\  +  u2  +  u3  +  . . .  is  convergent  and  has  the  sum  l  is  of 
course  the  same  as  to  say  that  the  sum 

Sn  =  Ul  +  u2  H - \-un  =  (t>i  +  v2  4 - h  vn)  +  i(w  1  +  w2  H - h  wn) 

converges  to  the  limit  l  as  n  00. 

In  the  case  of  real  functions  and  series  we  also  gave  definitions  of  di¬ 
vergence  and  oscillation,  finite  or  infinite.  But  in  the  case  of  complex 
functions  and  series,  where  we  have  to  consider  the  behaviour  both  of  p(n) 
and  of  a(n),  there  are  so  many  possibilities  that  this  is  hardly  worth  while. 
When  it  is  necessary  to  make  further  distinctions  of  this  kind,  we  shall 
make  them  by  stating  the  way  in  which  the  real  or  imaginary  parts  behave 
when  taken  separately. 
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86.  The  reader  will  find  no  difficulty  in  proving  such  theorems  as  the 
following,  which  are  obvious  extensions  of  theorems  already  proved  for  real 
functions  and  series. 

(1)  If  lim  (f>(n)  =  l  then  lim  <j){n  +  p)  —  l  for  any  fixed  value  of  p. 

(2)  If  u\  +  u2  +  ■  ■  .  is  convergent  and  has  the  sum  /,  then  a  +  b  +  c  + 

- b  k  +  u\  +  u2  +  . . .  is  convergent  and  has  the  sum  a  +  b  +  c-\ - b  k  +  l, 

and  up+ 1  +  up+2  +  . . .  is  convergent  and  has  the  sum  l  —  u\  —  u2 - —  up. 

(3)  If  lim0(n)  =  l  and  lim^(n)  =  m,  then 

lim{0(n)  +  ip(n)}  =  /  +  m. 


(4)  If  lim0(n)  =  /,  then  lim  h(f)(n)  =  kl. 

(5)  If  lim  d(n)  =  l  and  lim =  m,  then  lim 0(n)^(n)  =  Im. 

(6)  If  ui  +  U2  +  ■  ■  ■  converges  to  the  sum  /,  and  v\  +  V2  +  . . .  to  the 
sum  m,  then  (ui  +  Ui)  +  (u2  +  v2)  +  . . .  converges  to  the  sum  l  +  m. 

(7)  If  u\ +u2  + . . .  converges  to  the  sum  l  then  ku±  +  ku2  + . . .  converges 
to  the  sum  kl. 

(8)  If  Ui  +  u2  +  u3  +  . . .  is  convergent  then  lim  un  =  0. 

(9)  If  Mi  +  u2  +  «3  +  •  •  •  is  convergent,  then  so  is  any  series  formed 
by  grouping  the  terms  in  brackets,  and  the  sums  of  the  two  series  are  the 
same. 

As  an  example,  let  us  prove  theorem  (5).  Let 

4>(n)  =  p(n )  +  ia(n),  ip(n)  =  p(n)  +  ia'(n ),  l  =  r  +  is,  m  =  r  +  is  . 


Then 

But 


pin)  r,  cr(n)  — >  s,  p(n)  — >  r' ,  a'( n )  — >  s'. 
c/>(n)V’(n)  =  pf>  -  a  o'  +  i(pa'  +  p'a), 

/  r  .  i  /  .  t.i 

pp  —  aa  — >  rr  —  ss  ,  per  +  p  a  — >  rs  +  r  s; 


so  that 


< f>(n)i/>(n )  — >•  rr'  —  ss'  +  i(rs  +  r's), 
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i.e. 


4>(n)if(n)  — >  (r  +  is)(r'  +  is')  =  Im. 


The  following  theorems  are  of  a  somewhat  different  character. 

(10)  In  order  that  <f{n )  =  p{n)+ia(n)  should  converge  to  zero  as  n  — »  oo, 
it  is  necessary  and  sufficient  that 

W«)l  =  \/WVP  +  VRF 

should  converge  to  zero. 

If  p(n)  and  a(n)  both  converge  to  zero  then  it  is  plain  that  y /p2  +  a2  does 
so.  The  converse  follows  from  the  fact  that  the  numerical  value  of  p  or  a  cannot 
be  greater  than  \J p2  +  a2. 

(11)  More  generally,  in  order  that  <fi(n)  should  converge  to  a  limit  l,  it 
is  necessary  and  sufficient  that 


\<Kn)-l\ 


should  converge  to  zero. 

For  <f{n)  —  l  converges  to  zero,  and  we  can  apply  (10). 

(12)  Theorems  1  and  2  of  §§  83-84  are  still  true  when  <f(n)  and  un  are 
complex. 

We  have  to  show  that  the  necessary  and  sufficient  condition  that  4>(n)  should 
tend  to  l  is  that 

\f(n2)  -  <Xni)|  <  e  (1) 

when  n2  >  n±  ^  no- 

If  4>{n)  — >  l  then  p(n)  r  and  a (n)  — >  s,  and  so  we  can  find  numbers  n'0 
and  t?.q  depending  on  e  and  such  that 

|p(n2)  -  p{ni)\  <  4e,  |cr(n2)  -  cr(ni)|  <  |e, 

the  first  inequality  holding  when  n2  >  n\  ^  n'0,  and  the  second  when 
ri2  >  n\  ^  ti'q  .  Hence 

| ffm)  -  <t>{n{)\  ^  |p(n2)  -  p(ni)\  +  |<r(n2)  -  cr(ni)|  <  e 
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when  n2  >  n\  ^  no,  where  no  is  the  greater  of  n'0  and  u'q.  Thus  the  condition  (1) 
is  necessary.  To  prove  that  it  is  sufficient  we  have  only  to  observe  that 

I p{n2)  -  p(ni)|  ^  \<t>(n2)  -  <t>{n{)\  <  e 

when  n2  >  n\  ^  uq.  Thus  p{n)  tends  to  a  limit  r,  and  in  the  same  way  it  may 
be  shown  that  <r(n)  tends  to  a  limit  s. 

87.  The  limit  of  zn  as  n  — >  oo,  z  being  any  complex  number. 

Let  us  consider  the  important  case  in  which  4>{n)  =  zn.  This  problem  has 
already  been  discussed  for  real  values  of  2  in  §  72. 

If  zn  — »  l  then  zn+1  — >  l ,  by  (1)  of  §  86.  But,  by  (4)  of  §  86, 

zn+1  =  zz11  ->•  zl, 

and  therefore  l  =  zl ,  which  is  only  possible  if  (a)  l  =  0  or  (b)  z  =  1.  If 
z  —  1  then  limz"  =  1.  Apart  from  this  special  case  the  limit,  if  it  exists, 
can  only  be  zero. 

Now  if  z  —  r(cos0  +  % sin0),  where  r  is  positive,  then 
zn  =  rn  (cos  nO  +  i  sin  n9), 

so  that  \zn\  =  rn.  Thus  \zn\  tends  to  zero  if  and  only  if  r  <  1;  and  it  follows 
from  (10)  of  §  86  that 

lim  zn  =  0 

if  and  only  if  r  <  1.  In  no  other  case  does  zn  converge  to  a  limit,  except 
when  z  —  1  and  zn  — >  1. 

88.  The  geometric  series  1  +  z  +  z2  +  . . .  when  z  is  complex. 

Since 

sn  =  1  +  z  +  +  •  •  •  +  V1-1  =  (1  -  zn)/(  1  -  z), 

unless  z  —  1,  when  the  value  of  sn  is  n,  it  follows  that  the  series 
1  +  z  +  z1  +  . . .  is  convergent  if  and  only  if  r  —  \z\  <  1.  And  its  sum  when 
convergent  is  1/(1  —  z). 
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Thus  if  z  =  r(cos  9  +  i  sin  6)  =  r  Cis  9,  and  r  <  1,  we  have 
l  +  z  +  z2  +  ...  =  1/(1  —  r  Cis  9), 


or 


1  +  r  Cis  9  +  r2  Cis  26  +  ...  —  1/(1  —  r  Cis  9) 

=  (1  —  rcos9  +  ir  sin  9) /(I  —  2rcos9  +  r2). 


Separating  the  real  and  imaginary  parts,  we  obtain 

1  +  rcos#  +  r2  cos  29  +  . . .  =  (1  —  rcos9)/(l  —  2r  cos  9  +  r2), 

r  sin  9  +  r2  sin  29  +  . . .  =  r  sin  9/ (1  —  2 r  cos  9  +  r2), 

provided  r  <  1.  If  we  change  9  into  9  +  tt,  we  see  that  these  results  hold 
also  for  negative  values  of  r  numerically  less  than  1.  Thus  they  hold  when 
-1  <  r  <  1. 

Examples  XXXIII.  1.  Prove  directly  that  4>(n)  =  rn  cos  nO  converges 
to  0  when  r  <  1  and  to  1  when  r  =  1  and  9  is  a  multiple  of  2vr.  Prove  further 
that  if  r  =  1  and  9  is  not  a  multiple  of  2ir,  then  4>(n)  oscillates  finitely;  if  r  >  1 

and  9  is  a  multiple  of  2-7T,  then  4>(n)  — >  +oo;  and  if  r  >  1  and  9  is  not  a  multiple 

of  27t,  then  (j)(n)  oscillates  infinitely. 

2.  Establish  a  similar  series  of  results  for  <j>(n)  =  rn  sin  n9. 

3.  Prove  that 


zm  +  zm+ 1  +  .  .  .  =  zm/^  _ 


zm  +  2zm+l  +  2zm+ 2  +  .  .  .  =  zm^  +  zy  q 

if  and  only  if  \z\  <  1.  Which  of  the  theorems  of  §  86  do  you  use? 

4.  Prove  that  if  —  1  <  r  <  1  then 

1  +  2r  cos  9  +  2 r2  cos  29  +  •  •  •  =  (1  —  r2) /(I  —  2 r  cos  9  +  r2). 

5.  The  series 


1  +  z  if  \z/(l  +  z)\  <  1.  Show  that  this 


condition  is  equivalent  to  the  condition  that  z  has  a  real  part  greater  than  —  \. 
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MISCELLANEOUS  EXAMPLES  ON  CHAPTER  IV. 

1.  The  function  4>(n)  takes  the  values  1,  0,  0,  0,  1,  0,  0,  0,  1,  . . .  when 
n  =  0,  1,2,....  Express  (j)(n)  in  terms  of  n  by  a  formula  which  does  not  involve 
trigonometrical  functions.  [<f>(n)  =  |{1  +  (—  l)n  +  in  +  (— i)n}.] 

2.  If  <j){n)  steadily  increases,  and  fi>(n)  steadily  decreases,  as  n  tends  to  oo, 
and  if  fi>(n)  >  <j>{n)  for  all  values  of  n,  then  both  <j>(n)  and  fij( n )  tend  to  limits, 
and  lirri  <p(n)  ^  lim^(n).  [This  is  an  immediate  corollary  from  §  69.] 

3.  Prove  that,  if 


<Rn)  =  1  +  - 

n 


V’(n)  =1 - 

n 


then  cj)(n  +  1)  >  (j>(n)  and  ip(n  +  1)  <  'ip(n).  [The  first  result  has  already  been 
proved  in  §  73.] 

4.  Prove  also  that  ip(n)  >  4>(n)  for  all  values  of  n:  and  deduce  (by  means 
of  the  preceding  examples)  that  both  </>(n)  and  r/(n)  tend  to  limits  as  n  tends 
to  oo.* 

5.  The  arithmetic  mean  of  the  products  of  all  distinct  pairs  of  positive 
integers  whose  sum  is  n  is  denoted  by  Sn.  Show  that  lim(<S'n/n2)  =  1/6. 

(Math.  Trip.  1903.) 

6.  Prove  that  if  x\  =  x  +  (A/x)},  X2  =  \{x\  +  (A/x i)},  and  so  on, 
x  and  A  being  positive,  then  lirri  xn  =  \[A. 

r  „  ,  Xn  ~  VA  ( X  —  \[AS 

Prove  first  that  - 7=  =  - -= 

xn  +  \/A  \x  +  VAy 

7.  If  4>(n)  is  a  positive  integer  for  all  values  of  n,  and  tends  to  00  with  n, 
then  aAn)  tends  to  0  if  0  <  x  <  1  and  to  +00  if  x  >  1 .  Discuss  the  behaviour 
of  x^n\  asn-l  00,  for  other  values  of  x. 

8.1  If  an  increases  or  decreases  steadily  as  n  increases,  then  the  same  is  true 
of  (ai  +  a2  H - b  an)/n. 

9.  If  xn+\  =  y/k  +  xn,  and  k  and  x\  are  positive,  then  the  sequence  x\,  x'2, 
X3,  ...  is  an  increasing  or  decreasing  sequence  according  as  xi  is  less  than  or 


*A  proof  that  lim (V’(n)  —  (f>(n)}  =  0,  and  that  therefore  each  function  tends  to  the 
limit  e,  will  be  found  in  Chrystal’s  Algebra ,  vol.  ii,  p.  78.  We  shall  however  prove  this 
in  Ch.  IX  by  a  different  method. 

fExs.  8-12  are  taken  from  Bromwich’s  Infinite  Series. 
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greater  than  a,  the  positive  root  of  the  equation  x2  =  x  +  k;  and  in  either  case 
xn  — y  ol  as  n  — y  oo. 

10.  If  Xn+i  =  k/(  1  +  xn),  and  k  and  x\  are  positive,  then  the  sequences 
xi,  X3,  X5,  . . .  and  X2,  X4,  xq,  . . .  are  one  an  increasing  and  the  other  a  decreas¬ 
ing  sequence,  and  each  sequence  tends  to  the  limit  ct,  the  positive  root  of  the 
equation  x2  +  x  =  k. 

11.  The  function  f(x )  is  increasing  and  continuous  (see  Ch.  V)  for  all  values 
of  x,  and  a  sequence  xi,  X2,  X3,  . . .  is  defined  by  the  equation  xn+\  =  f(xn). 
Discuss  on  general  graphical  grounds  the  question  as  to  whether  xn  tends  to  a 
root  of  the  equation  x  =  f(x).  Consider  in  particular  the  case  in  which  this 
equation  has  only  one  root,  distinguishing  the  cases  in  which  the  curve  y  =  /(x) 
crosses  the  line  y  =  x  from  above  to  below  and  from  below  to  above. 

12.  If  x\,  X2  are  positive  and  xn+\  =  ^(xn+xn_ 1),  then  the  sequences  xi,  X3, 
X5,  ...  and  x’2,  X4,  xg,  ...  are  one  a  decreasing  and  the  other  an  increasing 
sequence,  and  they  have  the  common  limit  |(xi  +  2x2). 

13.  Draw  a  graph  of  the  function  y  defined  by  the  equation 

x2n  sin  kx  +  x2 

y  =  lim - 2m — • 

n— >00  x^1'1  +  1 


(Math.  Trip.  1901.) 

14.  The  function 

V  =  lim  - k - 

n  — >00  1  +  n  sin  7rx 

is  equal  to  0  except  when  x  is  an  integer,  and  then  equal  to  1.  The  function 


,  D(x)  +  ncj)(x)  sin2  7rx 

y  =  lim  - g - 

rw-oo  1  +  n  sill  7TX 


is  equal  to  4>(x)  unless  x  is  an  integer,  and  then  equal  to  i^(x). 

15.  Show  that  the  graph  of  the  function 


,  xn4>(x )  +  x  n^(x ) 

y  =  Inn  - 

n— loo  xn  +  X  n 


is  composed  of  parts  of  the  graphs  of  4>(x)  and  i/j(x ),  together  with  (as  a  rule) 
two  isolated  points.  Is  y  defined  when  (a)  x  =  1,  (b)  x  =  —1,  (c)  x  =  0? 
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16.  Prove  that  the  function  y  which  is  equal  to  0  when  x  is  rational,  and 
to  1  when  x  is  irrational,  may  be  represented  in  the  form 

y  =  lim  sgn {sin2 (ml  ir x)}, 

m— >oo 


where 

sgn  x  =  lim  (2/ 7r)  arctan(?rx), 

n— >oc 

as  in  Ex.  XXXI.  14.  [If  x  is  rational  then  sin2(m!7rx),  and  therefore 
sgn{sin2(m! 7rx)},  is  equal  to  zero  from  a  certain  value  of  m  onwards:  if 
x  is  irrational  then  sin2(m!  7rx)  is  always  positive,  and  so  sgn{sin2(m!  7rx)}  is 
always  equal  to  1.] 

Prove  that  y  may  also  be  represented  in  the  form 


1—  lim  [  lim  {cos(m!  7rx)}2”]. 
m—>oon—>oo 


17.  Sum  the  series 


OO^OO  -y 

EE^  +  i)’  E  v(v  +  1) . . .  (v  +  k) ' 


[Since 


1 


1 


1 


1 


v(u  +  1) . . .  (i '  +  k)  k  \  +  1) . . .  (v  +  k  —  1)  (v  +  l)(v  +  2) . . .  (u  +  k)  J  ’ 

we  have 

1  1  f  1  1 


E 


v(v  +  1) . . .  (v  +  k)  k  \  1  •  2  . . .  k  (n  +  l)(n  +  2) . . .  (n  +  k) 


and  so 


E 


u(u  +  1) . . .  (v  +  k)  k(k\) ' 


18.  If  \z\  <  led,  then 


L  Lf  z  z2 

- — - (  n — I — ^  + 

z  —  a  a  V  ol  ar 
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and  if  \z\  >  |a|,  then 


L 


z  —  a 


19.  Expansion  of  (Az  +  B)/(az2  +  2 bz  +  c)  in  powers  of  z.  Let  a,  (3  be 
the  roots  of  az 2  +  2 bz  +  c  =  0,  so  that  az 2  +  2 bz  +  c  =  a(z  —  a)(z  —  f3).  We  shall 
suppose  that  A,  B,  a,  b,  c  are  all  real,  and  a  and  (3  unequal.  It  is  then  easy  to 
verify  that 


Az  +  B  1  /  Aa  +  B  A/3  +  B\ 

az2  +  2 bz  +  c  a(a  —  /3)  \  z  —  a  z  —  (3  J 

There  are  two  cases,  according  as  b2  >  ac  or  b2  <  ac. 

(1)  If  b2  >  ac  then  the  roots  a,  f3  are  real  and  distinct.  If  \z\  is  less  than 
either  |a|  or  \/3\  we  can  expand  1  /{z  —  a)  and  l/(z  —  (3)  in  ascending  powers  of  z 
(Ex.  18).  If  |z|  is  greater  than  either  |a|  or  |/3|  we  must  expand  in  descending 
powers  of  z\  while  if  \z\  lies  between  |a|  and  |/3|  one  fraction  must  be  expanded 
in  ascending  and  one  in  descending  powers  of  z.  The  reader  should  write  down 
the  actual  results.  If  \z\  is  equal  to  |a|  or  \/3\  then  no  such  expansion  is  possible. 

(2)  If  b2  <  ac  then  the  roots  are  conjugate  complex  numbers  (Ch.  Ill  §  43), 
and  we  can  write 

a  =  pCis(j),  (3  =  p  Cis(— cf>), 

where  p2  =  a/3  =  c/a,  pcoscf)  =  |(a  +  (3)  =  —b/a,  so  that  cos  (j>  =  — \Jb2  jac , 
sin  (f)  =  y/l  —  ( b2/ac ). 

If  \z\  <  p  then  each  fraction  may  be  expanded  in  ascending  powers  of  z.  The 
coefficient  of  zn  will  be  found  to  be 

Apsinncj)  +  B  sin{(n  +  1  )</)} 
apn+1  sin  <j) 

If  \z\  >  p  we  obtain  a  similar  expansion  in  descending  powers,  while  if  \z\  =  p 
no  such  expansion  is  possible. 

20.  Show  that  if  \z\  <  1  then 


1  +  2z  +  3z2  +  •  •  •  +  (n  +  1  )zn  +  •  •  •  =  1/(1  -  z)2. 
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\  —  zn  nzn 

[The  sum  to  n  terms  is  - - -k - .] 

(1  —  z)z  1  —  2 

21.  Expand  LI (z  —  a)2  in  powers  of  2,  ascending  or  descending  according  as 
\z\  <  |a|  or  1 2|  >  |a|. 

22.  Show  that  if  b2  =  ac  and  |a2|  <  |6|  then 

Az  +  B _ =  y' 

az 2  +  2 bz  +  c  n 

where  pn  =  {(— a)n /bn+2}{(n  +  1  )aB  —  nbA };  and  find  the  corresponding  ex¬ 
pansion,  in  descending  powers  of  2,  which  holds  when  |a2|  >  |6|. 

23.  Verify  the  result  of  Ex.  19  in  the  case  of  the  fraction  1/(1  +  z2).  [We 
have  1/(1  +  22)  =  2n  sin{4(n  +  l)7r}  =  1  —  z2  +  z4  — 

24.  Prove  that  if  1 2|  <  1  then 

1  2  .°°. 

- - o  =  2n  sin{|(n  +  l)vr}. 

1  +  2  +  22  V3  0 

25.  Expand  (1  +  2)/(l  +  22),  (1  +  z2)/(  1  +  23)  and  (1  +  2  +  22)/(l  +  24)  in 
ascending  powers  of  2.  For  what  values  of  2  do  your  results  hold? 

26.  If  a/ (a  +  bz  +  cz 2)  =  1  +  p\z  +  P2Z2  +  . . .  then 

2  22  a  +  cz  a2 

l+Plz  +  p2z  +...  =  — -a2_^2_2ac)z  +  c2z2. 

(Math.  Trip.  1900.) 

27.  If  lim  sn  =  l  then 


..  si  +  S2  +  •  •  •  +  sn 

inn  -  =  1. 

n— >00  Ti 

[Let  sn  =  l  +  tn.  Then  we  have  to  prove  that  (ii  +  £2  +  •  •  •  +  tn)/n  tends  to 
zero  if  tn  does  so. 

We  divide  the  numbers  + ,  t2,  ■  ■  ■ ,  tn  into  two  sets  t\ ,  t2.;  . . . ,  tp  and  tp+ 1, 
tp+ 2,  . . . ,  tn.  Here  we  suppose  that  p  is  a  function  of  n  which  tends  to  00  as 
n  — >•  00,  but  more  slowly  than  n,  so  that  p  — >  00  and  p/n  — »•  0:  e.g.  we  might 
suppose  p  to  be  the  integral  part  of  \fn. 
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Let  e  be  any  positive  number.  However  small  e  may  be,  we  can  choose  no  so 
that  tp- |_i,  tp+ 2,  . . . ,  tn  are  all  numerically  less  than  when  n  ^  no,  and  so 

I  {tp+i  +  tp+2  H - Vtn)/n\  <  \e{n-  p)/n  <  \e. 

But,  if  A  is  the  greatest  of  the  moduli  of  all  the  numbers  t\,  t2,  ■  .  . ,  we  have 

\(h  + t2  4 - 1-  tP)/n\  <  pA/n , 

and  this  also  will  be  less  than  when  n  ^  no,  if  no  is  large  enough,  since 
p/n  — >  0  as  n  — >  oo.  Thus 


I (ti  +  t2  +  ■  ■  ■  +  tn)/n\  'A  | (ti  +  t2  +  •  •  •  +  tp)/ n\  +  | (fp+ i  +  •  •  •  +  tn)/^|  <  e 

when  n  ^  no;  which  proves  the  theorem. 

The  reader,  if  he  desires  to  become  expert  in  dealing  with  questions  about 
limits,  should  study  the  argument  above  with  great  care.  It  is  very  often  neces¬ 
sary,  in  proving  the  limit  of  some  given  expression  to  be  zero,  to  split  it  into  two 
parts  which  have  to  be  proved  to  have  the  limit  zero  in  slightly  different  ways. 
When  this  is  the  case  the  proof  is  never  very  easy. 

The  point  of  the  proof  is  this:  we  have  to  prove  that  (t±  +  t2  +  ■  ■  ■  +  tn)/n 
is  small  when  n  is  large,  the  f’s  being  small  when  their  suffixes  are  large.  We 
split  up  the  terms  in  the  bracket  into  two  groups.  The  terms  in  the  first  group 
are  not  all  small,  but  their  number  is  small  compared  with  n.  The  number  in 
the  second  group  is  not  small  compared  with  n,  but  the  terms  are  all  small, 
and  their  number  at  any  rate  less  than  n,  so  that  their  sum  is  small  compared 
with  n.  Hence  each  of  the  parts  into  which  (ti+t2  +  -  ■  ■ +tn)/n  has  been  divided 
is  small  when  n  is  large.] 

28.  If  4>(n)  —  (j)(n  —  1)  — >  l  as  n  — >  oo,  then  (f>{n)/n  — >  l. 

[If  4>(n)  =  si  +  s2  +  ■  ■  ■  +  sn  then  4>(n)  —  </ >(n  —  1)  =  sn,  and  the  theorem 
reduces  to  that  proved  in  the  last  example.] 

29.  If  sn  =  j{1  —  (— l)n},  so  that  sn  is  equal  to  1  or  0  according  as  n  is  odd 
or  even,  then  (si  +  s2  +  •  •  •  +  sn)/n  — >  \  as  n  — >  oo. 

[This  example  proves  that  the  converse  of  Ex.  27  is  not  true:  for  sn  oscillates 
as  n  — >  oo.] 

30.  If  cn,  sn  denote  the  sums  of  the  first  n  terms  of  the  series 

2  +  cos  9  +  cos  29  +  ... ,  sin  9  +  sin  2  9  +  ... , 
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then 

lim(ci  +  c2  H - 1-  cn)/n  =  0,  lim(si  +  s2  H - b  sn)/n  =  \  cot  \8. 


CHAPTER  V 


LIMITS  OF  FUNCTIONS  OF  A  CONTINUOUS  VARIABLE. 

CONTINUOUS  AND  DISCONTINUOUS  FUNCTIONS 

89.  Limits  as  x  tends  to  oo.  We  shall  now  return  to  functions  of  a 
continuous  real  variable.  We  shall  confine  ourselves  entirely  to  one-valued, 
functions,*  and  we  shall  denote  such  a  function  by  4>(x).  We  suppose  x  to 
assume  successively  all  values  corresponding  to  points  on  our  fundamental 
straight  line  A,  starting  from  some  definite  point  on  the  line  and  progressing 
always  to  the  right.  In  these  circumstances  we  say  that  x  tends  to  infinity, 
or  to  oo,  and  write  x  — >  oo.  The  only  difference  between  the  ‘tending 
of  n  to  oo’  discussed  in  the  last  chapter,  and  this  ‘tending  of  £  to  oo’,  is 
that  x  assumes  all  values  as  it  tends  to  oo,  i.e.  that  the  point  P  which 
corresponds  to  x  coincides  in  turn  with  every  point  of  A  to  the  right  of 
its  initial  position,  whereas  n  tended  to  oo  by  a  series  of  jumps.  We  can 
express  this  distinction  by  saying  that  x  tends  continuously  to  oo. 

As  we  explained  at  the  beginning  of  the  last  chapter,  there  is  a  very  close 
correspondence  between  functions  of  x  and  functions  of  n.  Every  function 
of  n  may  be  regarded  as  a  selection  from  the  values  of  a  function  of  x.  In 
the  last  chapter  we  discussed  the  peculiarities  which  may  characterise  the 
behaviour  of  a  function  <p(n)  as  n  tends  to  oo.  Now  we  are  concerned  with 
the  same  problem  for  a  function  <fr(x)]  and  the  definitions  and  theorems 
to  which  we  are  led  are  practically  repetitions  of  those  of  the  last  chapter. 
Thus  corresponding  to  Def.  1  of  §  58  we  have: 

Definition  1.  The  function  <p(x)  is  said  to  tend  to  the  limit  l  as 
x  tends  to  oo  if.  when  any  positive  number  e,  however  small,  is  assigned,  a 
number  x  fie)  can  be  chosen  such  that,  for  all  values  of  x  equal  to  or  greater 
than  xfie),  fifx)  differs  from  l  by  less  than  e,  i.e.  if 

\fi(x)  -l\<e 


when  x  A  xo(e). 

*Thus  fix  stands  in  this  chapter  for  the  one-valued  function  -) -fix  and  not  (as  in 
§  26)  for  the  two- valued  function  whose  values  are  +fix  and  —fix. 
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When  this  is  the  case  we  may  write 


lim  <j>(x)  =  l, 
#—>•00 


or,  when  there  is  no  risk  of  ambiguity,  simply  lim0(x)  =  /,  or  <f>{x)  — >  l. 
Similarly  we  have: 

Definition  2.  The  function  <f>{x)  is  said  to  tend  to  00  with  x  if,  when 
any  number  A,  however  large,  is  assigned,  we  can  choose  a  number  x0(A) 
such  that 

4>{x)  >  A 


when  x  A  xo(A). 
We  then  write 


<f>(x)  — *  00. 


Similarly  we  define  <p(x)  — >  —  00.*  Finally  we  have: 

Definition  3.  If  the  conditions  of  neither  of  the  two  preceding  def¬ 
initions  are  satisfied,  then  (j>(x)  is  said  to  oscillate  as  x  tends  to  00.  If 
\<j>(x)\  is  less  than  some  constant  K  when  x  A  then  <f>(x)  is  said  to 
oscillate  finitely,  and  otherwise  infinitely. 

The  reader  will  remember  that  in  the  last  chapter  we  considered  very 
carefully  various  less  formal  ways  of  expressing  the  facts  represented  by  the 
formulae  (j){n)  — »  l,  <f{n)  — >  00.  Similar  modes  of  expression  may  of  course 
be  used  in  the  present  case.  Thus  we  may  say  that  4>(x)  is  small  or  nearly 
equal  to  l  or  large  when  x  is  large,  using  the  words  ‘small’,  ‘nearly’,  ‘large’ 
in  a  sense  similar  to  that  in  which  they  were  used  in  Ch.  IV. 


*We  shall  sometimes  find  it  convenient  to  write  +00,  x  — >  +00,  <p(x)  — >  +00  instead 
of  00,  x  — »  00,  4>{x)  — >  00. 

Un  the  corresponding  definition  of  §62,  we  postulated  that  \4>{n)\  <  K  for  all 
values  of  n,  and  not  merely  when  n  ^  nj.  But  then  the  two  hypotheses  would  have 
been  equivalent;  for  if  \4>(n)\  <  I\  when  n  A  no,  then  \<j>(n)\  <  K'  for  all  values  of  n, 
where  K'  is  the  greatest  of  |b(l)|,  |b(2)|,  ■  ■  ■ ,  |b(no  —  1) |  and  K.  Here  the  matter  is  not 
quite  so  simple,  as  there  are  infinitely  many  values  of  x  less  than  Xq. 
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Examples  XXXIV.  1.  Consider  the  behaviour  of  the  following  func¬ 
tions  as  x  — >  oo:  1/x,  1  +  (1/x),  x 2,  xk ,  [x],  x  —  [x],  [x]  +  \J x  —  [x] . 

The  first  four  functions  correspond  exactly  to  functions  of  n  fully  discussed 
in  Ch.  IV.  The  graphs  of  the  last  three  were  constructed  in  Ch.  II  (Exs.  XVI. 
1,  2,  4),  and  the  reader  will  see  at  once  that  [x]  — >  oo,  x  —  [x]  oscillates  finitely, 
and  [x]  +  \J x  —  [x]  — >  oo. 

One  simple  remark  may  be  inserted  here.  The  function  <f>(x)  =  x  —  [x] 
oscillates  between  0  and  1,  as  is  obvious  from  the  form  of  its  graph.  It  is  equal 
to  zero  whenever  x  is  an  integer,  so  that  the  function  <j>{n)  derived  from  it  is 
always  zero  and  so  tends  to  the  limit  zero.  The  same  is  true  if 

4>{x)  =  sinx7r,  <j>{n)  =  sinri7r  =  0. 

It  is  evident  that  4>(x)  — >  l  or  4>(x)  — >•  oo  or  </>(x)  — >  — oo  involves  the  corre¬ 
sponding  property  for  but  that  the  converse  is  by  no  means  always  true. 

2.  Consider  in  the  same  way  the  functions: 

(sinx7r)/x,  xsinx7r,  (xsinx7r)2,  tanx7r,  a  cos2  X7r  +  6  sin2  xir, 

illustrating  your  remarks  by  means  of  the  graphs  of  the  functions. 

3.  Give  a  geometrical  explanation  of  Def.  1,  analogous  to  the  geometrical 
explanation  of  Ch.  IV,  §  59. 

4.  If  </(x)  — >•  l,  and  l  is  not  zero,  then  4>(x)  cosx7 r  and  <j>(x)  sinx7r  oscillate 
finitely.  If  <f>(x)  — >  oo  or  4>(x)  — >  —  oo,  then  they  oscillate  infinitely.  The  graph 
of  either  function  is  a  wavy  curve  oscillating  between  the  curves  y  =  <j>{x)  and 

V  =  ~<t>(x)- 

5.  Discuss  the  behaviour,  as  x  — >  oo,  of  the  function 

y  =  /(x)  cos2  X7r  +  F(x)  sin2  X7r, 

where  /(x)  and  F(x)  are  some  pair  of  simple  functions  ( e.g .  x  and  x2).  [The 
graph  of  y  is  a  curve  oscillating  between  the  curves  y  =  /(x),  y  =  F(x).] 

90.  Limits  as  x  tends  to  —  oo.  The  reader  will  have  no  difficulty 
in  framing  for  himself  definitions  of  the  meaning  of  the  assertions  lx  tends 
to  — oo’,  or  ‘x  — >  — oo’  and 

lim  0(x)  =  l,  4>(x)  — >  oo,  (j){x)  — >  — oo. 

x — y — oo 
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In  fact,  if  x  —  —y  and  (j>{x)  =  <j>(—y)  =  if{y),  then  y  tends  to  oo  as  x  tends 
to  —  oo,  and  the  question  of  the  behaviour  of  0(x)  as  x  tends  to  —  oo  is  the 
same  as  that  of  the  behaviour  of  if(y)  as  y  tends  to  oo. 

91.  Theorems  corresponding  to  those  of  Ch.  IV,  §§  63—67.  The 
theorems  concerning  the  sums,  products,  and  quotients  of  functions  proved  in 
Ch.  IV  are  all  true  (with  obvious  verbal  alterations  which  the  reader  will  have 
no  difficulty  in  supplying)  for  functions  of  the  continuous  variable  x.  Not  only 
the  enunciations  but  the  proofs  remain  substantially  the  same. 

92.  Steadily  increasing  or  decreasing  functions.  The  definition 
which  corresponds  to  that  of  §  69  is  as  follows:  the  function  cj>(x)  will  be  said  to 
increase  steadily  with  x  if  cj)(x 2)  ^  0(aq)  whenever  X2  >  x\.  In  many  cases,  of 
course,  this  condition  is  only  satisfied  from  a  definite  value  of  x  onwards,  i.e. 
when  X2  >  x\  ^  xq-  The  theorem  which  follows  in  that  section  requires  no 
alteration  but  that  of  n  into  x:  and  the  proof  is  the  same,  except  for  obvious 
verbal  changes. 

If  <f>{x 2)  >  <f(xi),  the  possibility  of  equality  being  excluded,  whenever 
X2  >  x\ ,  then  (j>{x)  will  be  said  to  be  steadily  increasing  in  the  stricter  sense. 
We  shall  find  that  the  distinction  is  often  important  (cf.  §§  108-109). 

The  reader  should  consider  whether  or  no  the  following  functions  increase 
steadily  with  x  (or  at  any  rate  increase  steadily  from  a  certain  value  of  x  on¬ 
wards):  x2  —  x,  x  +  sin  a;,  x  +  2  sinx,  x2  +  2sinx,  [x],  [x]+sinx,  [x]  +  \J x  —  [x] . 
All  these  functions  tend  to  00  as  x  — >  00. 

93.  Limits  as  x  tends  to  0.  Let  <f>(x)  be  such  a  function  of  x  that 

lim  4>(x)  =  l,  and  let  y  —  1/x.  Then 
£—>■00 

cj)(x)  =  0(1  /y)  = 

say.  As  x  tends  to  00,  y  tends  to  the  limit  0,  and  if(y)  tends  to  the  limit  l. 

Let  us  now  dismiss  x  and  consider  ifiv)  simply  as  a  function  of  y.  We 
are  for  the  moment  concerned  only  with  those  values  of  y  which  correspond 
to  large  positive  values  of  x,  that  is  to  say  with  small  positive  values  of  y. 
And  if(y)  has  the  property  that  by  making  y  sufficiently  small  we  can 
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make  if{y)  differ  by  as  little  as  we  please  from  /.  To  put  the  matter  more 
precisely,  the  statement  expressed  by  lim0(x)  =  l  means  that,  when  any 
positive  number  e,  however  small,  is  assigned,  we  can  choose  xo  so  that 
| <f>(x)  —l |  <  e  for  all  values  of  x  greater  than  or  equal  to  xq.  But  this  is  the 
same  thing  as  saying  that  we  can  choose  y0  =  l/x0  so  that  \ip(y)  —  l\  <  e 
for  all  positive  values  of  y  less  than  or  equal  to  y0. 

We  are  thus  led  to  the  following  definitions: 

A.  If,  when  any  positive  number  e,  however  small,  is  assigned,  we  can 
choose  Uo(e)  so  that 

\<i>(y)  ~l\<e 

when  0  <  y  A  y0(e) ,  then  we  say  that  4>(y)  tends  to  the  limit  l  as  y  tends 
to  0  by  positive  values,  and  we  write 


lim  <j>(y)  =  I 

y- >+o 


B.  If,  when  any  number  A,  however  large,  is  assigned,  we  can  choose 
y0 (A)  so  that 

<f>(y)  >  A 

when  0  <  y  A  yo(A),  then  we  say  that  cp(y)  tends  to  oo  as  y  tends  to  0  by 
positive  values,  and  we  write 

4>{y)  ->  oo- 


We  define  in  a  similar  way  the  meaning  of  ‘0(y)  tends  to  the  limit  l 
as  y  tends  to  0  by  negative  values’,  or  dim  (j>(y)  =  l  when  y  -A  —O’.  We 
have  in  fact  only  to  alter  0  <  y  A  yo(e)  to  —yo{e)  A  y  <  0  in  definition  A. 
There  is  of  course  a  corresponding  analogue  of  definition  B,  and  similar 
definitions  in  which 

4>{y)  -t  -oo 

as  y  — »  +0  or  y  — >  —0. 

If  lim  <f>(y)  =  l  and  lim  <f>(y)  =  /,  we  write  simply 

y— >+ 0  y — y — 0 
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This  case  is  so  important  that  it  is  worth  while  to  give  a  formal  definition. 

If,  when  any  positive  number  e,  however  small,  is  assigned,  we  can 
choose  2/o(e)  so  that,  for  all  values  of  y  different  from  zero  but  numerically 
less  than  or  equal  to  y^e),  <f>(y)  differs  from  l  by  less  than  e,  then  we  say 
that  (f>(y )  tends  to  the  limit  l  as  y  tends  to  0,  and  write 

lim  (f>{y)  =  l . 
y->  o 


So  also,  if  4>(y)  — »  oo  as  y  — »  +0  and  also  as  y  — *  —0,  we  say  that 
<f>{y)  — >  oo  as  y  — »  0.  We  define  in  a  similar  manner  the  statement  that 
<f>(y)  -A  — oo  as  y  -A  0. 

Finally,  if  <j>(y)  does  not  tend  to  a  limit,  or  to  oo,  or  to  — oo,  as  y  — »  +0, 
we  say  that  (f>(y)  oscillates  as  y  — *  +0,  finitely  or  infinitely  as  the  case  may 
be;  and  we  define  oscillation  as  y  — >  —0  in  a  similar  manner. 

The  preceding  definitions  have  been  stated  in  terms  of  a  variable  de¬ 
noted  by  y:  what  letter  is  used  is  of  course  immaterial,  and  we  may  suppose 
x  written  instead  of  y  throughout  them. 

94.  Limits  as  x  tends  to  a.  Suppose  that  <f>{y)  — »  l  as  y  — >  0,  and 
write 

y  =  x  —  a,  4>(y)  =  <f>(x  —  a)  =  if(x). 

If  y  — >  0  then  x  -A  a  and  if(x)  — >  /,  and  we  are  naturally  led  to  write 

lim  if(x)  =  /, 

x^a 

or  simply  liim/;(x)  =  l  or  if(x)  -A  /,  and  to  say  that  if(x)  tends  to  the 
limit  l  as  x  tends  to  a.  The  meaning  of  this  equation  may  be  formally  and 
directly  defined  as  follows:  if,  given  e,  we  can  always  determine  5(e)  so 
that 

1 4>{x)  —  l  |  <  e 

when  0  <  \x  —  a\  ^  5(e),  then 


lim  (j>(x)  =  l. 

x^a 
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By  restricting  ourselves  to  values  of  x  greater  than  a,  i.e.  by  replacing 
0  <  \x  —  a\  ^  5(e)  by  a  <  x  ^  a  +  5(e),  we  define  l(p(x)  tends  to  l  when 
x  approaches  a  from  the  right’,  which  we  may  write  as 

lim  (fix)  =  l. 

x— m+0 

In  the  same  way  we  can  define  the  meaning  of 

lim  (fix )  =  /. 

cc— >-a— 0 

Thus  lim  (fix )  =  /  is  equivalent  to  the  two  assertions 

x^a 

lim  (bix)  =  L  lim  c Mx )  =  l. 

£— >a+ 0  x^-a—0 

We  can  give  similar  definitions  referring  to  the  cases  in  which  (fix)  — >  oo 
or  (ft(x )  — *  — oo  as  x  — >  a  through  values  greater  or  less  than  a;  but  it  is 
probably  unnecessary  to  dwell  further  on  these  definitions,  since  they  are 
exactly  similar  to  those  stated  above  in  the  special  case  when  a  =  0,  and 
we  can  always  discuss  the  behaviour  of  cp(x )  as  x  — >•  a  by  putting  x  —  a  =  y 
and  supposing  that  y  — >  0. 

95.  Steadily  increasing  or  decreasing  functions.  If  there  is  a 
number  6  such  that  4>{xr)  ^  <f>( x ")  whenever  a  —  5  <  x'  <  x"  <  a  +  5,  then 
4>(x)  will  be  said  to  increase  steadily  in  the  neighbourhood  of  x  =  a. 

Suppose  first  that  x  <  a,  and  put  y  =  l/(a  —  x).  Then  y— >-ooasx— >a  —  0, 
and  4>(x)  =  ip(y)  is  a  steadily  increasing  function  of  y,  never  greater  than  4>(a). 
It  follows  from  §  92  that  (fix)  tends  to  a  limit  not  greater  than  (f(a).  We  shall 
write 

lim  <j)(x)  =  (f(a  +  0). 

x— m+0 

We  can  define  (f(a  —  0)  in  a  similar  manner;  and  it  is  clear  that 

4>{a  —  0)  ^  4>(a)  ^  <f(a  +  0). 

It  is  obvious  that  similar  considerations  may  be  applied  to  decreasing  functions. 

If  (fix')  <  (fix"),  the  possibility  of  equality  being  excluded,  whenever 
a  —  6  <  x'  <  x"  <  a  +  5,  then  (fix)  will  be  said  to  be  steadily  increasing  in  the 
stricter  sense. 
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96.  Limits  of  indetermination  and  the  principle  of  conver¬ 
gence.  All  of  the  argument  of  §§  80-84  may  be  applied  to  functions  of  a  con¬ 
tinuous  variable  x  which  tends  to  a  limit  a.  In  particular,  if  4>(x)  is  bounded  in 
an  interval  including  a  (i.e.  if  we  can  find  5,  H,  and  K  so  that  H  <  4>(x)  <  K 
when  a  —  6  ^  x  ^  a  T  5).*  then  we  can  define  A  and  A,  the  lower  and  up¬ 
per  limits  of  indetermination  of  f(x)  as  x  — >  a,  and  prove  that  the  necessary 
and  sufficient  condition  that  (j>(x)  — >  l  as  x  — >  a  is  that  A  =  A  =  l.  We  can 
also  establish  the  analogue  of  the  principle  of  convergence,  i.e.  prove  that  the 
necessary  and  sufficient  condition  that  cj>{x)  should  tend  to  a  limit  as  x  — >•  a 
is  that,  when  e  is  given,  we  can  choose  5(e)  so  that  \f>(x-2)  —  </>(.'Ci)|  <  e  when 
0  <  \x2  —  a|  <  \xi  —  a\  ^  5(e). 

Examples  XXXV.  1.  If 

4>(x)  — >  l ,  if(x)  — >  lf, 

as  x  — >  a,  then  (j>(x)  +  x )  — >  l  T  l',  cj)(x)if(x)  — >  W ,  and  <f>(x)/ip(x)  — >  l/l', 

unless  in  the  last  case  V  =  0. 

[We  saw  in  §  91  that  the  theorems  of  Ch.  IV,  §§  63  et  seq.  hold  also  for 
functions  of  x  when  x  — >  oo  or  x  — >  —  oo.  By  putting  x  =  1/y  we  may  extend 
them  to  functions  of  y,  when  y  — >  0,  and  by  putting  y  =  z  —  a  to  functions  of  z, 
when  z  — ^  a. 

The  reader  should  however  try  to  prove  them  directly  from  the  formal  def¬ 
inition  given  above.  Thus,  in  order  to  obtain  a  strict  direct  proof  of  the  first 
result  he  need  only  take  the  proof  of  Theorem  I  of  §  63  and  write  throughout 
x  for  to,  a  for  oo  and  0  <  \x  —  a|  5=  5  for  to  ^  too-] 

2.  If  m  is  a  positive  integer  then  xm  — >  0  as  x  — >  0. 

3.  If  m  is  a  negative  integer  then  xm  — >  +oo  as  x  — >  +0,  while  xm  — >  —  oo 
or  xm  — >  Too  as  x  — >  —0,  according  as  m  is  odd  or  even.  If  m  =  0  then  xm  =  1 
and  xm  — >  1. 

4.  lim  (a  +  hx  +  cx2  T  •  •  •  T  kxm)  =  a. 

x^O 

5.  lim  {(a  T  bx  +  ■  ■  ■  +  kxm)/(a  +  fix  T  •  •  •  T  nx^1)}  =  a/a,  unless  a  =  0. 

x — 

If  a  =  0  and  a  /  0,  ^  /  0,  then  the  function  tends  to  Too  or  — oo,  as  x  — >  TO, 

*For  some  further  discussion  of  the  notion  of  a  function  bounded  in  an  interval  see 
§102. 
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according  as  a  and  j3  have  like  or  unlike  signs;  the  case  is  reversed  if  x  —>  —0. 
The  case  in  which  both  a  and  a  vanish  is  considered  in  Ex.  xxxvi.  5.  Discuss 
the  cases  which  arise  when  a  /  0  and  more  than  one  of  the  first  coefficients  in 
the  denominator  vanish. 

6.  lim  xm  =  am,  if  m  is  any  positive  or  negative  integer,  except  when  a  =  0 

x^a 

and  m  is  negative.  [If  m  >  0,  put  x  =  y  +  a  and  apply  Ex.  4.  When  m  <  0, 
the  result  follows  from  Ex.  1  above.  It  follows  at  once  that  limi->(a:)  =  P(a),  if 
P(x)  is  any  polynomial.] 

7.  lim  R(x)  =  R(a),  if  R  denotes  any  rational  function  and  a  is  not  one  of 

x  —¥a 

the  roots  of  its  denominator. 

8.  Show  that  lim  xm  =  am  for  all  rational  values  of  m,  except  when  a  =  0 

x^ra 

and  m  is  negative.  [This  follows  at  once,  when  a  is  positive,  from  the  inequalities 
(9)  or  (10)  of  §  74.  For  \xm  —  am\  <  H\x  —  a\,  where  H  is  the  greater  of  the 
absolute  values  of  mxm~1  and  mara_1  (cf.  Ex.  XXVIII.  4).  If  a  is  negative  we 
write  x  =  —  y  and  a  =  —b.  Then 

lim  xm  =  lim(— 1  )mym  =  (-1  )mbm  =  am .] 


97.  The  reader  will  probably  fail  to  see  at  first  that  any  proof  of  such 
results  as  those  of  Exs.  4,  5,  6,  7,  8  above  is  necessary.  He  may  ask  ‘why 
not  simply  put  x  =  0,  or  x  =  a?  Of  course  we  then  get  a,  a/a,  am, 
P(a),  R(a)\  It  is  very  important  that  he  should  see  exactly  where  he  is 
wrong.  We  shall  therefore  consider  this  point  carefully  before  passing  on 
to  any  further  examples. 

The  statement 

lim  Mx)  =  l 

rr->-0 

is  a  statement  about  the  values  of  (f>(x)  when  x  has  any  value  distinct  from 
but  differing  by  little  from  zero.*  It  is  not  a  statement  about  the  value 
of  4>(x)  when  x  —  0.  When  we  make  the  statement  we  assert  that,  when 
x  is  nearly  equal  to  zero,  <f(x)  is  nearly  equal  to  l.  We  assert  nothing 
whatever  about  what  happens  when  x  is  actually  equal  to  0.  So  far  as  we 


*Thus  in  Def.  A  of  §  93  we  make  a  statement  about  values  of  y  such  that  0  <  y  5s  y0, 
the  first  of  these  inequalities  being  inserted  expressly  in  order  to  exclude  the  value  y  =  0. 
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know,  0(0  may  not  be  defined  at  all  for  x  =  0;  or  it  may  have  some  value 
other  than  l.  For  example,  consider  the  function  defined  for  all  values  of  x 
by  the  equation  0(0  =  0.  It  is  obvious  that 

lim0(0  =  0.  (1) 

Now  consider  the  function  0(0  which  differs  from  0(0  only  in  that 
0(0  =  1  when  x  —  0.  Then 


lim0(0  =  O,  (2) 

for,  when  x  is  nearly  equal  to  zero,  0(0  is  not  only  nearly  but  exactly 
equal  to  zero.  But  "0(0)  =  1.  The  graph  of  this  function  consists  of  the 
axis  of  x ,  with  the  point  x  =  0  left  out,  and  one  isolated  point,  viz.  the 
point  (0,1).  The  equation  (2)  expresses  the  fact  that  if  we  move  along 
the  graph  towards  the  axis  of  y,  from  either  side,  then  the  ordinate  of  the 
curve,  being  always  equal  to  zero,  tends  to  the  limit  zero.  This  fact  is  in 
no  way  affected  by  the  position  of  the  isolated  point  (0, 1). 

The  reader  may  object  to  this  example  on  the  score  of  artificiality: 
but  it  is  easy  to  write  down  simple  formulae  representing  functions  which 
behave  precisely  like  this  near  x  —  0.  One  is 

0(0)  =  [1-0], 

where  [1  —  x2]  denotes  as  usual  the  greatest  integer  not  greater  than  1  —  x2. 
For  if  x  =  0  then  0(0  =  [1]  =  1;  while  if  0  <  x  <  1,  or  —1  <  x  <  0,  then 
0  <  1  —  x2  <  1  and  so  0(0  =  [1  —  x2}  =  0. 

Or  again,  let  us  consider  the  function 

y  =  x/x 

already  discussed  in  Ch.  II,  §24,  (2).  This  function  is  equal  to  1  for  all 
values  of  x  save  x  =  0.  It  is  not  equal  to  1  when  x  =  0:  it  is  in  fact 
not  defined  at  all  for  x  =  0.  For  when  we  say  that  0(0  is  defined  for 
x  =  0  we  mean  (as  we  explained  in  Ch.  II,  l.c .)  that  we  can  calculate  its 
value  for  x  =  0  by  putting  x  —  0  in  the  actual  expression  of  0(0.  In 
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this  case  we  cannot.  When  we  put  x  —  0  in  (j>(x)  we  obtain  0/0,  which  is 
a  meaningless  expression.  The  reader  may  object  ‘divide  numerator  and 
denominator  by  x\  But  he  must  admit  that  when  x  =  0  this  is  impossible. 
Thus  y  =  x/x  is  a  function  which  differs  from  y  =  1  solely  in  that  it  is  not 
defined  for  x  —  0.  None  the  less 

lim(a:/a:)  =  1, 

for  x/x  is  equal  to  1  so  long  as  x  differs  from  zero,  however  small  the 
difference  may  be. 

Similarly  <f>(x)  =  {(x  +  l)2  —  l}/x  =  x  +  2  so  long  as  x  is  not  equal  to 
zero,  but  is  undefined  when  x  =  0.  None  the  less  lim0(:c)  =  2. 

On  the  other  hand  there  is  of  course  nothing  to  prevent  the  limit  of  4>(x) 
as  x  tends  to  zero  from  being  equal  to  0(0),  the  value  of  (j>(x)  for  x  —  0. 
Thus  if  (j)(x)  =  x  then  0(0)  =  0  and  lim0(x)  =  0.  This  is  in  fact,  from  a 
practical  point  of  view,  i.  e.  from  the  point  of  view  of  what  most  frequently 
occurs  in  applications,  the  ordinary  case. 

Examples  XXXVI.  1.  lim(.r2  —  o2)/(x  —  a)  =  2a. 

x^a 

2.  lim(xm  —  am)/(x  —  a)  =  mam~ 1,  if  m  is  any  integer  (zero  included). 

X—Hl 

3.  Show  that  the  result  of  Ex.  2  remains  true  for  all  rational  values  of  m, 
provided  a  is  positive.  [This  follows  at  once  from  the  inequalities  (9)  and  (10) 
of  §  74.] 

4.  lim(ar  —  2x 5  +  l)/(x3 4 5  —  3x2  +  2)  =  1.  [Observe  that  x  —  1  is  a  factor  of 

x— s>l 

both  numerator  and  denominator.] 

5.  Discuss  the  behaviour  of 

0(x)  =  (a0xm  +  alXm+1  +  •  •  •  +  akxm+k)/(b0xn  +  blXn+1  +  •  •  •  +  blXn+l) 

as  x  tends  to  0  by  positive  or  negative  values. 

[If  m  >  n,  lim0(x)  =  0.  If  m  =  n,  lim0(x)  =  a^/bo.  If  m  <  n  and  n  —  m 
is  even,  0(x)  -»  +oo  or  4>(x)  -A  — oo  according  as  a^/bo  >  0  or  ao/bo  <  0.  If 
m  <  n  and  n  —  m  is  odd,  0(x)  — >  +oo  as  x  — >  +0  and  0(x)  -A  — oo  as  x  A  —0, 
or  0(x)  — >  — oo  as  x  -A  +0  and  0(x)  — +oo  as  x  — >  —0,  according  as  ao/bo  >  0 
or  ao/bo  <  0.] 
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6.  Orders  of  smallness.  When  x  is  small  x2  is  very  much  smaller,  x3  much 
smaller  still,  and  so  on:  in  other  words 

lim(x2/x)  =  0,  lim(x3/x2)  =  0,  .... 

x—>0K 

Another  way  of  stating  the  matter  is  to  say  that,  when  x  tends  to  0,  x2, 
x3,  . . .  all  also  tend  to  0,  but  x2  tends  to  0  more  rapidly  than  x ,  x3  than  x2, 
and  so  on.  It  is  convenient  to  have  some  scale  by  which  to  measure  the  rapidity 
with  which  a  function,  whose  limit,  as  x  tends  to  0,  is  0,  diminishes  with  x,  and 
it  is  natural  to  take  the  simple  functions  x,  x 2,  x3,  ...  as  the  measures  of  our 
scale. 

We  say,  therefore,  that  fi(x)  is  of  the  first  order  of  smallness  if  f>[x)/x  tends 
to  a  limit  other  than  0  as  x  tends  to  0.  Thus  2x  +  3x2  +  x7  is  of  the  first  order 
of  smallness,  since  lim(2x'  +  3x2  +  x7)/x  =  2. 

Similarly  we  define  the  second,  third,  fourth,  . . .  orders  of  smallness.  It  must 
not  be  imagined  that  this  scale  of  orders  of  smallness  is  in  any  way  complete.  If 
it  were  complete,  then  every  function  (f{x)  which  tends  to  zero  with  x  would  be 
of  either  the  first  or  second  or  some  higher  order  of  smallness.  This  is  obviously 
not  the  case.  For  example  4>(x)  =  x 7 ^  tends  to  zero  more  rapidly  than  x  and 
less  rapidly  than  x2. 

The  reader  may  not  unnaturally  think  that  our  scale  might  be  made  complete 

by  including  in  it  fractional  orders  of  smallness.  Thus  we  might  say  that  x7//-5  was 

of  the  ?th  order  of  smallness.  We  shall  however  see  later  on  that  such  a  scale  of 
5 

orders  would  still  be  altogether  incomplete.  And  as  a  matter  of  fact  the  integral 
orders  of  smallness  defined  above  are  so  much  more  important  in  applications 
than  any  others  that  it  is  hardly  necessary  to  attempt  to  make  our  definitions 
more  precise. 

Orders  of  greatness.  Similar  definitions  are  at  once  suggested  to  meet  the 
case  in  which  4>(x)  is  large  (positively  or  negatively)  when  x  is  small.  We  shall  say 
that  4>(x)  is  of  the  fcth  order  of  greatness  when  x  is  small  if  fi(x)/x~k  =  xkf>(x) 
tends  to  a  limit  different  from  0  as  x  tends  to  0. 

These  definitions  have  reference  to  the  case  in  which  x  — >•  0.  There  are  of 
course  corresponding  definitions  relating  to  the  cases  in  which  x  — >  oo  or  x  — >  a. 
Thus  if  xk<f(x)  tends  to  a  limit  other  than  zero,  as  x  oo,  then  we  say  that 
4>{x)  is  of  the  fcth  order  of  smallness  when  x  is  large:  while  if  (x  —  a)k(f(x )  tends 
to  a  limit  other  than  zero,  as  x  — >  a,  then  we  say  that  4>(x)  is  of  the  kth  order 
of  greatness  when  x  is  nearly  equal  to  a. 
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7*  lim  \/l  +  x  =  lim  y/1  —  x  =  1.  [Put  l  +  x  =  yorl  —  x  =  y,  and  use 

Ex.  xxxv.  8.] 

8.  lim{  y/1  +  x  —  \/l  —  x}/x  =  1.  [Multiply  numerator  and  denominator  by 
yjl  +  x  +  \/l  —  x.] 

9.  Consider  the  behaviour  of  {y/l  +  xm  —  y/l  —  xm}/xn  as  x  — >  0,  m  and  n 
being  positive  integers. 

10.  lim{ \/l  +  x  +  x2  —  l}/x  =  t, . 

y/l  +  x-  \/l+x2 

11.  lim  . _ _ _ ,  =  1. 

Vl  —  x2  —  \Jl  —  x 

12.  Draw  a  graph  of  the  function 


V  = 


1111 

H - r  H - t  H - 


x  —  1 


x~\  x~\ 


x  —  1 


+ 


x-\  x-\ 

1  1  1 


+ 


x- i 


Has  it  a  limit  as  x  — >  0?  [Here  y  =  1  except  for  x  =  1,  3,  when  y  is 

not  defined,  and  y  — >  1  as  x  — >  0.] 

13.  lim  —  =  1. 

x 

[It  may  be  deduced  from  the  definitions  of  the  trigonometrical  ratios^  that 
if  x  is  positive  and  less  than  ~tt  then 


sin  x  <  x  <  tan  x 


or 


sinx 

cos  x  <  -  <  1 

X 


or 


0  <  1  - 


sin  x 


n  -1 

<  1  —  cosx  =  2  sin"  ^  x 


But  2sin2|x  <  2(^x)2  =  ^x2.  Hence  lim  (  1  —  Sm X 
2  2  2  s-s>+0  \  x 

sm  x  sm  x 

lim  - =  1.  As  -  is  an  even  function,  the  result  follows.] 

£->+0  x  x 


0,  and 


*In  the  examples  which  follow  it  is  to  be  assumed  that  limits  as  x  — >  0  are  required, 
unless  (as  in  Exs.  19,  22)  the  contrary  is  explicitly  stated. 

iThe  proofs  of  the  inequalities  which  are  used  here  depend  on  certain  properties  of 
the  area  of  a  sector  of  a  circle  which  are  usually  taken  as  geometrically  intuitive;  for 
example,  that  the  area  of  the  sector  is  greater  than  that  of  the  triangle  inscribed  in  the 
sector.  The  justification  of  these  assumptions  must  be  postponed  to  Ch.  VII. 
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14.  lim 


1  —  cos  x 


x * 
sin.  cxoc 

15.  lim - =  a.  Is  this  true  if  a  =  0? 

x 

arc  sin  x 

16.  lim -  =  1.  [Put  x  =  siny.J 


x 

tan  ax 

17.  lim -  =  a, 


arc  tan  ax 
lim - =  a. 


18.  lim 

19.  lim 

1^1 


x 

cosec  x  —  cot  x 


x 

1  +  COS  7 TX 


1 

2- 


tan2  7 tx  2 

20.  How  do  the  functions  sin(l/x),  (1/x)  sin(l/x),  xsin(l/x)  behave  as 
x  — >  0?  [The  first  oscillates  finitely,  the  second  infinitely,  the  third  tends  to  the 
limit  0.  None  is  defined  when  x  =  0.  See  Exs.  xv.  6,  7,  8.] 

21.  Does  the  function 


1\  / (  .  1 

y  =  I  sin  —  )  /  |  sm  — 


tend  to  a  limit  as  x  tends  to  0?  [No.  The  function  is  equal  to  1  except  when 
sin(l/x)  =  0;  i.e.  when  x  =  1/n,  l/2n,  . . . ,  —  1/ 7r,  — 1/27T,  ....  For  these  values 
the  formula  for  y  assumes  the  meaningless  form  0/0,  and  y  is  therefore  not 
defined  for  an  infinity  of  values  of  x  near  x  =  0.] 

22.  Prove  that  if  m  is  any  integer  then  [x]  — >•  m  and  x  —  [x]  — >  0  as  x  — >  m+ 0, 
and  [x]  — >  m  —  1,  x  —  [x]  — >  1  as  x  — >  m  —  0. 


98.  Continuous  functions  of  a  real  variable.  The  reader  has  no 
doubt  some  idea  as  to  what  is  meant  by  a  continuous  curve.  Thus  he  would 
call  the  curve  C  in  Fig.  29  continuous,  the  curve  C  generally  continuous 
but  discontinuous  for  x  =  £'  and  x  —  £". 

Either  of  these  curves  may  be  regarded  as  the  graph  of  a  function  4>(x). 
It  is  natural  to  call  a  function  continuous  if  its  graph  is  a  continuous  curve, 
and  otherwise  discontinuous.  Let  us  take  this  as  a  provisional  definition 
and  try  to  distinguish  more  precisely  some  of  the  properties  which  are 
involved  in  it. 

In  the  first  place  it  is  evident  that  the  property  of  the  function  y  =  <p(x) 
of  which  C  is  the  graph  may  be  analysed  into  some  property  possessed  by 
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the  curve  at  each  of  its  points.  To  be  able  to  define  continuity  for  all 
values  of  x  we  must  first  define  continuity  for  any  particular  value  of  x. 
Let  us  therefore  fix  on  some  particular  value  of  x,  say  the  value  x  =  £ 
corresponding  to  the  point  P  of  the  graph.  What  are  the  characteristic 
properties  of  (j>{x)  associated  with  this  value  of  x! 

In  the  first  place  (j>{x)  is  defined  for  x  =  f.  This  is  obviously  essential. 
If  </>(£)  were  not  defined  there  would  be  a  point  missing  from  the  curve. 

Secondly  cp(x)  is  defined  for  all  values  of  x  near  x  =  £;  i.e.  we  can  find 
an  interval,  including  x  —  f  in  its  interior,  for  all  points  of  which  (j){x)  is 
defined. 

Thirdly  if  x  approaches  the  value  f  from  either  side  then  (f(x)  ap¬ 
proaches  the  limit  0(£). 

The  properties  thus  defined  are  far  from  exhausting  those  which  are 
possessed  by  the  curve  as  pictured  by  the  eye  of  common  sense.  This  pic¬ 
ture  of  a  curve  is  a  generalisation  from  particular  curves  such  as  straight 
lines  and  circles.  But  they  are  the  simplest  and  most  fundamental  proper¬ 
ties:  and  the  graph  of  any  function  which  has  these  properties  would,  so 
far  as  drawing  it  is  practically  possible,  satisfy  our  geometrical  feeling  of 
what  a  continuous  curve  should  be.  We  therefore  select  these  properties  as 
embodying  the  mathematical  notion  of  continuity.  We  are  thus  led  to  the 
following 

Definition.  The  function  f>(x)  is  said  to  be  continuous  for  x  =  f  if  it 
tends  to  a  limit  as  x  tends  to  f  from  either  side,  and  each  of  these  limits 
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is  equal  to  <j>(£). 

We  can  now  define  continuity  throughout  an  interval.  The  function  <f>(x) 
is  said  to  be  continuous  throughout  a  certain  interval  of  values  of  x  if  it  is 
continuous  for  all  values  of  x  in  that  interval.  It  is  said  to  be  continuous 
everywhere  if  it  is  continuous  for  every  value  of  x.  Thus  [x]  is  continuous 
in  the  interval  [5, 1  —  5],  where  5  is  any  positive  number  less  than  and  1 
and  x  are  continuous  everywhere. 

If  we  recur  to  the  definitions  of  a  limit  we  see  that  our  definition  is 
equivalent  to  ‘4>(x)  is  continuous  for  x  =  £  if,  given  e,  we  can  choose  5(e) 
so  that  | <f>(x)  —  0(0  |  <  e  if  0  ^  \x  —  £|  ^  5(e)’. 

We  have  often  to  consider  functions  defined  only  in  an  interval  [a,b]. 
In  this  case  it  is  convenient  to  make  a  slight  and  obvious  change  in  our 
definition  of  continuity  in  so  far  as  it  concerns  the  particular  points  a  and  b. 
We  shall  then  say  that  0(x)  is  continuous  for  x  =  a  if  0(a  +  0)  exists  and 
is  equal  to  0(a),  and  for  x  =  b  if  0(5  —  0)  exists  and  is  equal  to  (f>(b). 

99.  The  definition  of  continuity  given  in  the  last  section  may  be  illus¬ 
trated  geometrically  as  follows.  Draw  the  two  horizontal  lines  y  =  0(0  —  e 
and  y  =  0(0  +  e.  Then  \(f>(x)  —  0(01  <  6  expresses  the  fact  that  the  point 
on  the  curve  corresponding  to  x  lies  between  these  two  lines.  Similarly 


x  —  0  ^5  expresses  the  fact  that  x  lies  in  the  interval  [£  —  5,  £  +  5].  Thus 
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our  definition  asserts  that  if  we  draw  two  such  horizontal  lines,  no  matter 
how  close  together,  we  can  always  cut  off  a  vertical  strip  of  the  plane  by 
two  vertical  lines  in  such  a  way  that  all  that  part  of  the  curve  which  is  con¬ 
tained  in  the  strip  lies  between  the  two  horizontal  lines.  This  is  evidently 
true  of  the  curve  C  (Fig.  29),  whatever  value  £  may  have. 

We  shall  now  discuss  the  continuity  of  some  special  types  of  functions. 
Some  of  the  results  which  follow  were  (as  we  pointed  out  at  the  time) 
tacitly  assumed  in  Ch.  II. 

Examples  XXXVII.  1.  The  sum  or  product  of  two  functions  continu¬ 
ous  at  a  point  is  continuous  at  that  point.  The  quotient  is  also  continuous  unless 
the  denominator  vanishes  at  the  point.  [This  follows  at  once  from  Ex.  xxxv.  1.] 

2.  Any  polynomial  is  continuous  for  all  values  of  x.  Any  rational  fraction 
is  continuous  except  for  values  of  x  for  which  the  denominator  vanishes.  [This 
follows  from  Exs.  xxxv.  6,  7.] 

3.  y/x  is  continuous  for  all  positive  values  of  x  (Ex.  xxxv.  8).  It  is  not 
defined  when  x  <  0,  but  is  continuous  for  x  =  0  in  virtue  of  the  remark  made 
at  the  end  of  §  98.  The  same  is  true  of  xm/n,  where  m  and  n  are  any  positive 
integers  of  which  n  is  even. 

4.  The  function  xm/n,  where  n  is  odd,  is  continuous  for  all  values  of  x. 

5.  \/x  is  not  continuous  for  x  =  0.  It  has  no  value  for  x  =  0,  nor  does  it 
tend  to  a  limit  as  x  — >  0.  In  fact  1/x  — >•  +oo  or  \/x  — >  — oo  according  as  x  — >  0 
by  positive  or  negative  values. 

6.  Discuss  the  continuity  of  x~m^n,  where  m  and  n  are  positive  integers, 
for  x  =  0. 

7.  The  standard  rational  function  R(x)  =  P(x)/Q(x)  is  discontinuous  for 
x  =  a,  where  a  is  any  root  of  Q(x)  =  0.  Thus  ( x 2  +  l)/(x2  —  3x  +  2)  is  dis¬ 
continuous  for  x  =  1.  It  will  be  noticed  that  in  the  case  of  rational  functions  a 
discontinuity  is  always  associated  with  (a)  a  failure  of  the  definition  for  a  partic¬ 
ular  value  of  x  and  ( b )  a  tending  of  the  function  to  +oo  or  — oo  as  x  approaches 
this  value  from  either  side.  Such  a  particular  kind  of  point  of  discontinuity  is 
usually  described  as  an  infinity  of  the  function.  An  ‘infinity1  is  the  kind  of 
discontinuity  of  most  common  occurrence  in  ordinary  work. 

8.  Discuss  the  continuity  of 

y/(x  -  a)(b  -  x),  y/ (x  -  a)(b  -  x),  y/ (x  -  a)/(b  -  x),  y/ (x  -  a)/(b  -  x). 
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9.  sinx  and  cosx  are  continuous  for  all  values  of  x. 

[We  have 

sin  (a:  +  h)  —  sinx  =  2  sin  \h  cos(x  +  ^h), 

which  is  numerically  less  than  the  numerical  value  of  h] 

10.  For  what  values  of  x  are  tanx,  cotx,  secx,  and  cosec  x  continuous  or 
discontinuous? 

11.  If  f(y)  is  continuous  for  y  =  rj,  and  0(x)  is  a  continuous  function  of  x 
which  is  equal  to  y  when  x  =  £,  then  /{0(x)}  is  continuous  for  x  =  £. 

12.  If  0(x)  is  continuous  for  any  particular  value  of  x,  then  any  polynomial 
in  0(x),  such  as  a{0(x)}m  +  . . . ,  is  so  too. 

13.  Discuss  the  continuity  of 

l/(acos2  x  +  6  sin2  x),  y/2  +  cos  x,  yjl  +  sinx,  1  / \/ 1  +  sinx. 

14.  sin(l/x),  xsin(l/x),  and  x2sin(l/x)  are  continuous  except  for  x  =  0. 

15.  The  function  which  is  equal  to  xsin(l/x)  except  when  x  =  0,  and  to 
zero  when  x  =  0,  is  continuous  for  all  values  of  x. 

16.  [x]  and  x  —  [x]  are  discontinuous  for  all  integral  values  of  x. 

17.  For  what  (if  any)  values  of  x  are  the  following  functions  discontinuous: 
[X2],  [y/x],  yjx-  [x],  [x]  +  yjx  -  [x],  [2x] ,  [x]  +  [~x]? 

18.  Classification  of  discontinuities.  Some  of  the  preceding  examples 
suggest  a  classification  of  different  types  of  discontinuity. 

(1)  Suppose  that  0(x)  tends  to  a  limit  as  x  — >  a  either  by  values  less 
than  or  by  values  greater  than  a.  Denote  these  limits,  as  in  §  95,  by  0(a  —  0) 
and  0(a  +  0)  respectively.  Then,  for  continuity,  it  is  necessary  and  sufficient 
that  0(x)  should  be  defined  for  x  =  a,  and  that  < 0(a  —  0)  =  0(a)  =  0(a  +  0). 
Discontinuity  may  arise  in  a  variety  of  ways. 

(a)  0(a  —  0)  may  be  equal  to  0(a  +  0),  but  0(a)  may  not  be  defined,  or 
may  differ  from  0(a  —  0)  and  0(a  +  0).  Thus  if  0(x)  =  xsin(l/x)  and  a  =  0, 
0(0  —  0)  =  0(0  +  0)  =  0,  but  0(x)  is  not  defined  for  x  =  0.  Or  if  0(x)  =  [1  —  x2] 
and  a  =  0,  0(0  —  0)  =  0(0  +  0)  =  0,  but  0(0)  =  1. 

(0)  0(a  —  0)  and  0(a  +  0)  may  be  unequal.  In  this  case  0(a)  may  be 
equal  to  one  or  to  neither,  or  be  undefined.  The  first  case  is  illustrated 
by  0(x)  =  [x],  for  which  0(0  —  0)  =  —1,  0(0  +  0)  =  0(0)  =  0;  the  second  by 
0(x)  =  [x]  —  [— x],  for  which  0(0  —  0)  =  —1,  0(0  +  0)  =  1,  0(0)  =  0;  and  the 
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third  by  0(x)  =  [x]  +  xsin(l/x),  for  which  0(0  —  0)  =  —1,  0(0  +  0)  =  0,  and 
0(0)  is  undefined. 

In  any  of  these  cases  we  say  that  0(x)  has  a  simple  discontinuity  at  x  =  a. 
And  to  these  cases  we  may  add  those  in  which  0(x)  is  defined  only  on  one  side  of 
x  =  a,  and  0(a  — 0)  or  0(a  +  O),  as  the  case  may  be,  exists,  but  0(x)  is  either  not 
defined  when  x  =  a  or  has  when  x  =  a  a  value  different  from  0(a  — 0)  or  0(a  +  O). 

It  is  plain  from  §  95  that  a  function  which  increases  or  decreases  steadily  in 
the  neighbourhood  of  x  =  a  can  have  at  most  a  simple  discontinuity  for  x  =  a. 

(2)  It  may  be  the  case  that  only  one  (or  neither)  of  0(a  — 0)  and  0(a  +  O) 
exists,  but  that,  supposing  for  example  0(a  +  0)  not  to  exist,  0(x)  -»  +oo  or 
0(x)  — >  —  oo  as  x  — >  a  +  0,  so  that  0(x)  tends  to  a  limit  or  to  +oo  or  to  — oo  as 
x  approaches  a  from  either  side.  Such  is  the  case,  for  instance,  if  0(x)  =  1/x  or 
0( x)  =  1/x2,  and  a  =  0.  In  such  cases  we  say  (cf.  Ex.  7)  that  x  =  a  is  an  infinity 
of  0(x).  And  again  we  may  add  to  these  cases  those  in  which  0(x)  -»  +oo  or 
0(x)  — >  — oo  as  x  — a  from  one  side,  but  0(x)  is  not  defined  at  all  on  the  other 
side  of  x  =  a. 

(3)  Any  point  of  discontinuity  which  is  not  a  point  of  simple  discontinuity 
nor  an  infinity  is  called  a  point  of  oscillatory  discontinuity.  Such  is  the  point 
x  =  0  for  the  functions  sin(l/x),  (1/x)  sin(l/x). 

19.  What  is  the  nature  of  the  discontinuities  at  x  =  0  of  the  functions 
(sinx)/x,  [x]  +  [— x],  cosecx,  \/Y[x,  -y/l/x,  cosec(l/x),  sin(l/x)/ sin(l/x)? 

20.  The  function  which  is  equal  to  1  when  x  is  rational  and  to  0  when  x  is 
irrational  (Ch.  II,  Ex.  XVI.  10)  is  discontinuous  for  all  values  of  x.  So  too  is  any 
function  which  is  defined  only  for  rational  or  for  irrational  values  of  x. 

21.  The  function  which  is  equal  to  x  when  x  is  irrational  and  to 
-^/(l  +  p2)/(  1  +  q2)  when  x  is  a  rational  fraction  p/q  (Ch.  II,  Ex.  XVI.  11) 
is  discontinuous  for  all  negative  and  for  positive  rational  values  of  x,  but 
continuous  for  positive  irrational  values. 

22.  For  what  points  are  the  functions  considered  in  Ch.  IV,  Exs.  XXXI  dis¬ 
continuous,  and  what  is  the  nature  of  their  discontinuities?  [Consider,  e.g.,  the 
function  y  =  limxn  (Ex.  5).  Here  y  is  only  defined  when  —  1  <  x  ^  1:  it  is  equal 
to  0  when  —  1  <  x  <  1  and  to  1  when  x  =  1.  The  points  x  =  1  and  x  =  —  1  are 
points  of  simple  discontinuity.] 
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100.  The  fundamental  property  of  a  continuous  function.  It 

may  perhaps  be  thought  that  the  analysis  of  the  idea  of  a  continuous  curve 
given  in  §  98  is  not  the  simplest  or  most  natural  possible.  Another  method 
of  analysing  our  idea  of  continuity  is  the  following.  Let  A  and  B  be  two 
points  on  the  graph  of  00)  whose  coordinates  are  x0,  0Oo)  and  Xi,  0Oi) 
respectively.  Draw  any  straight  line  A  which  passes  between  A  and  B.  Then 
common  sense  certainly  declares  that  if  the  graph  of  00)  is  continuous  it 
must  cut  A. 

If  we  consider  this  property  as  an  intrinsic  geometrical  property  of  con¬ 
tinuous  curves  it  is  clear  that  there  is  no  real  loss  of  generality  in  supposing 
A  to  be  parallel  to  the  axis  of  x.  In  this  case  the  ordinates  of  A  and  B 
cannot  be  equal:  let  us  suppose,  for  definiteness,  that  0Oi)  >  0Oo)-  And 
let  A  be  the  line  y  =  rj,  where  0Oo)  <  rj  <  <f>(xi).  Then  to  say  that  the 
graph  of  00)  must  cut  A  is  the  same  thing  as  to  say  that  there  is  a  value 
of  x  between  x0  and  x\  for  which  00)  =  rj. 

We  conclude  then  that  a  continuous  function  00)  must  possess  the 
following  property:  if 


0Oo)  =  2/o,  000  =  2/i, 

and  yo  <  7]  <  yi,  then  there  is  a  value  of  x  between  xq  and  x\  for  which 
00)  =  V-  In  other  words  as  x  varies  from  x0  to  xi,  y  must  assume  at  least 
once  every  value  between  y0  and  yv . 

We  shall  now  prove  that  if  <f>fx)  is  a  continuous  function  of  x  in  the 
sense  defined  in  §  98  then  it  does  in  fact  possess  this  property.  There  is  a 
certain  range  of  values  of  x,  to  the  right  of  Xo,  for  which  00)  <  V-  For 
0Oo)  <  hi  and  so  00)  is  certainly  less  than  r /  if  00)  —  0Oo)  is  numerically 
less  than  rj  —  0(xo).  But  since  00)  is  continuous  for  x  =  x0,  this  condition 
is  certainly  satisfied  if  x  is  near  enough  to  Xq.  Similarly  there  is  a  certain 
range  of  values,  to  the  left  of  xi,  for  which  f>(x)  >  rj. 

Let  us  divide  the  values  of  x  between  xq  and  xi  into  two  classes  L,  R 
as  follows: 

(1)  in  the  class  L  we  put  all  values  f  of  x  such  that  00)  <  h  when 
x  =  f  and  for  all  values  of  x  between  x0  and  £ ; 

(2)  in  the  class  R  we  put  all  the  other  values  of  x,  i.e.  all  numbers  f 
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such  that  either  0(£)  ^  r)  or  there  is  a  value  of  x  between  x0  and  £  for 
which  0(x)  A  r). 

Then  it  is  evident  that  these  two  classes  satisfy  all  the  conditions  im¬ 
posed  upon  the  classes  L,  R  of  §  17,  and  so  constitute  a  section  of  the  real 
numbers.  Let  £0  be  the  number  corresponding  to  the  section. 

First  suppose  0(£o)  >  V->  so  that  £0  belongs  to  the  upper  class:  and  let 
0(£o)  =  V  +  k,  say.  Then  0(£')  <  r/  and  so 

HZo)  -  HZ')  >  k, 

for  all  values  of  £'  less  than  £o;  which  contradicts  the  condition  of  continuity 
for  x  =  £o- 

Next  suppose  0(£o)  —  V  ~  k  <  rj.  Then,  if  £'  is  any  number  greater 
than  £0;  either  HZ')  =  V  or  we  can  find  a  number  £"  between  £0  and  £' 
such  that  0(£")  ^  T).  In  either  case  we  can  find  a  number  as  near  to  £0  as 
we  please  and  such  that  the  corresponding  values  of  Hx)  differ  by  more 
than  k.  And  this  again  contradicts  the  hypothesis  that  Hx)  is  continuous 
for  x  =  £o- 

Hence  0(£o)  —  Vi  and  the  theorem  is  established.  It  should  be  observed 
that  we  have  proved  more  than  is  asserted  explicitly  in  the  theorem;  we 
have  proved  in  fact  that  £o  is  the  least  value  of  x  for  which  Hx)  =  V-  It 
is  not  obvious,  or  indeed  generally  true,  that  there  is  a  least  among  the 
values  of  x  for  which  a  function  assumes  a  given  value,  though  this  is  true 
for  continuous  functions. 

It  is  easy  to  see  that  the  converse  of  the  theorem  just  proved  is  not  true. 
Thus  such  a  function  as  the  function  <j>{x)  whose  graph  is  represented  by  Fig.  31 
obviously  assumes  at  least  once  every  value  between  4>(xq )  and  4>(x i):  yet  (j>(x)  is 
discontinuous.  Indeed  it  is  not  even  true  that  4>{x)  must  be  continuous  when 
it  assumes  each  value  once  and  once  only.  Thus  let  <j>(x)  be  defined  as  follows 
from  x  =  0  to  x  =  1.  If  x  =  0  let  <j)(x)  =  0;  if  0  <  x  <  1  let  4>{x)  =  1  —  x;  and 
if  x  =  1  let  4>(x)  =  1.  The  graph  of  the  function  is  shown  in  Fig.  32;  it  includes 
the  points  O,  C  but  not  the  points  A ,  B.  It  is  clear  that,  as  x  varies  from  0  to  1, 
(j>(x)  assumes  once  and  once  only  every  value  between  0(0)  =  0  and  0(1)  =  1; 
but  0(x)  is  discontinuous  for  x  =  0  and  x  =  1. 

As  a  matter  of  fact,  however,  the  curves  which  usually  occur  in  elementary 
mathematics  are  composed  of  a  finite  number  of  pieces  along  which  y  always 
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Fig.  31. 


A  C 


varies  in  the  same  direction.  It  is  easy  to  show  that  if  y  =  4>(x)  always  varies  in 
the  same  direction,  i.e.  steadily  increases  or  decreases,  as  x  varies  from  xq  to  xi, 
then  the  two  notions  of  continuity  are  really  equivalent,  i.e.  that  if  4>(x)  takes 
every  value  between  (j>(xo)  and  <j){x i)  then  it  must  be  a  continuous  function  in 
the  sense  of  §  98.  For  let  £  be  any  value  of  x  between  xq  and  x\.  As  x  — >  £ 
through  values  less  than  £,  <f)(x)  tends  to  the  limit  0(£  —  0)  (§95).  Similarly 
as  x  — »  £  through  values  greater  than  £,  <j>(x)  tends  to  the  limit  0(£  +  0).  The 
function  will  be  continuous  for  x  =  £  if  and  only  if 

<KZ-  o)  =  <K£)  =  <M£  +  o). 

But  if  either  of  these  equations  is  untrue,  say  the  first,  then  it  is  evident  that 
(j>(x)  never  assumes  any  value  which  lies  between  </>(£  —  0)  and  </>(£),  which  is 
contrary  to  our  assumption.  Thus  (j>(x)  must  be  continuous.  The  net  result  of 
this  and  the  last  section  is  consequently  to  show  that  our  common-sense  notion 
of  what  we  mean  by  continuity  is  substantially  accurate,  and  capable  of  precise 
statement  in  mathematical  terms. 


101.  In  this  and  the  following  paragraphs  we  shall  state  and  prove 
some  general  theorems  concerning  continuous  functions. 
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Theorem  1.  Suppose  that  (fix)  is  continuous  for  x  =  and  that 
(f{f)  is  positive.  Then  we  can  determine  a  positive  number  5  such  that 
(fif)  positive  throughout  the  interval  [£  —  5,  £  +  5]. 

For,  taking  e  =  ^0(0  in  the  fundamental  inequality  of  p.  212,  we  can 
choose  5  so  that 

10(0  -  0(01  <  10(0 
throughout  [£  —  S,  £  +  5],  and  then 

0(0  ^  0(0  -  10(0  -  0(01  >  10(0  >  o, 

so  that  0(0  is  positive.  There  is  plainly  a  corresponding  theorem  referring 
to  negative  values  of  0(0- 

Theorem  2.  If  (fix)  is  continuous  for  x  =  £,  and  0(0  vanishes  for 
values  of  x  as  near  to  f  as  we  please,  or  assumes,  for  values  of  x  as  near 
to  £  as  we  please,  both  positive  and  negative  values,  then  0(0  =  0. 

This  is  an  obvious  corollary  of  Theorem  1.  If  0(£)  is  not  zero,  it  must 
be  positive  or  negative;  and  if  it  were,  for  example,  positive,  it  would  be 
positive  for  all  values  of  x  sufficiently  near  to  £,  which  contradicts  the 
hypotheses  of  the  theorem. 

102.  The  range  of  values  of  a  continuous  function.  Let  us 

consider  a  function  (fix)  about  which  we  shall  only  assume  at  present  that 
it  is  defined  for  every  value  of  x  in  an  interval  [a,b\. 

The  values  assumed  by  0(x)  for  values  of  x  in  [a,  b]  form  an  aggregate  S 
to  which  we  can  apply  the  arguments  of  §  80,  as  we  applied  them  in  §  81 
to  the  aggregate  of  values  of  a  function  of  n.  If  there  is  a  number  K  such 
that  0(x)  K ,  for  all  values  of  x  in  question,  we  say  that  0(x)  is  bounded 
above.  In  this  case  (fix )  possesses  an  upper  bound  M:  no  value  of  0(x) 
exceeds  M ,  but  any  number  less  than  M  is  exceeded  by  at  least  one  value 
of  (f{x).  Similarly  we  define  ‘bounded  below1,  ‘ lower  bound1,  ‘bounded’,  as 
applied  to  functions  of  a  continuous  variable  x. 

Theorem  1.  If  (fix)  is  continuous  throughout  [a,b\,  then  it  is  bounded 
in  [a,  b\. 
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We  can  certainly  determine  an  interval  [a,  £],  extending  to  the  right 
from  a,  in  which  (j>(x)  is  bounded.  For  since  0(x)  is  continuous  for  x  =  a,  we 
can,  given  any  positive  number  e  however  small,  determine  an  interval  [a,  £] 
throughout  which  0(x)  lies  between  0(a)  —  e  and  0(a)  +  e;  and  obviously 
0(x)  is  bounded  in  this  interval. 

Now  divide  the  points  £  of  the  interval  [a,  b]  into  two  classes  L,  R, 
putting  £  in  L  if  0(£)  is  bounded  in  [a,  £],  and  in  R  if  this  is  not  the  case. 
It  follows  from  what  precedes  that  L  certainly  exists:  what  we  propose  to 
prove  is  that  R  does  not.  Suppose  that  R  does  exist,  and  let  0  be  the 
number  corresponding  to  the  section  whose  lower  and  upper  classes  are 
L  and  R.  Since  (j>{x)  is  continuous  for  x  =  0,  we  can,  however  small  e  may 
be,  determine  an  interval  [0  —  77,0  +  77]*  throughout  which 

0(0)  -  e  <  0(x)  <  0(0)  +  e. 

Thus  cp(x)  is  bounded  in  [0  —  77, 0  +  77].  Now  0  —  77  belongs  to  L.  Therefore 
(p(x)  is  bounded  in  [a,  0  —  77]:  and  therefore  it  is  bounded  in  the  whole 
interval  [a,  0  +  77] .  But  0  +  77  belongs  to  R  and  so  0(x)  is  not  bounded 
in  [a,  0  +  77].  This  contradiction  shows  that  R  does  not  exist.  And  so  0(x)  is 
bounded  in  the  whole  interval  [a,  b\. 

Theorem  2.  If  ej)(x)  is  continuous  throughout  [a,b],  and  M  and  m 
are  its  upper  and  lower  bounds,  then  0(rc)  assumes  the  values  M  and  m  at 
least  once  each  in  the  interval. 

For,  given  any  positive  number  e,  we  can  find  a  value  of  x  for  which 
M  —  <f{x)  <  e  or  1  /{M  —  <f{x)}  >  1/e.  Hence  1  /{M  —  00c)}  is  not 
bounded,  and  therefore,  by  Theorem  1,  is  not  continuous.  But  M  —  0(x) 
is  a  continuous  function,  and  so  1  /{M  —  (j>(x)}  is  continuous  at  any  point 
at  which  its  denominator  does  not  vanish  (Ex.  XXXVII.  1).  There  must 
therefore  be  one  point  at  which  the  denominator  vanishes:  at  this  point 
0(x)  =  M.  Similarly  it  may  be  shown  that  there  is  a  point  at  which 
0(x)  =  m. 

*  If  0  =  b  we  must  replace  this  interval  by  [0  —  77, 0],  and  0  +  77  by  0,  throughout  the 
argument  which  follows. 
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The  proof  just  given  is  somewhat  subtle  and  indirect,  and  it  may  be 
well,  in  view  of  the  great  importance  of  the  theorem,  to  indicate  alterna¬ 
tive  lines  of  proof.  It  will  however  be  convenient  to  postpone  these  for  a 
moment.* 

Examples  XXXVIII.  1.  If  cf>(x)  =  1/x  except  when  x  =  0,  and 
4>(x)  =  0  when  x  =  0,  then  (f>(x)  has  neither  an  upper  nor  a  lower  bound  in 
any  interval  which  includes  x  =  0  in  its  interior,  as  e.g.  the  interval  [ — 1,  H-l] . 

2.  If  4>{x)  =  1/x 2  except  when  x  =  0,  and  </>{x)  =  0  when  x  =  0,  then 
(j>(x)  has  the  lower  bound  0,  but  no  upper  bound,  in  the  interval  [ — 1,  H-l] . 

3.  Let  </>{x)  =  sin(l/x)  except  when  x  =  0,  and  (j>(x)  =  0  when  x  =  0. 
Then  (j>{x)  is  discontinuous  for  x  =  0.  In  any  interval  [— e,  +e]  the  lower  bound 
is  —1  and  the  upper  bound  +1,  and  each  of  these  values  is  assumed  by  4>(x)  an 
infinity  of  times. 

4.  Let  4>(x)  =  x  —  [x].  This  function  is  discontinuous  for  all  integral  values 
of  x.  In  the  interval  [0, 1]  its  lower  bound  is  0  and  its  upper  bound  1.  It  is  equal 
to  0  when  x  =  0  or  x  =  1,  but  it  is  never  equal  to  1.  Thus  <j>{x)  never  assumes 
a  value  equal  to  its  upper  bound. 

5.  Let  4>(x)  =  0  when  x  is  irrational,  and  (j>(x)  =  q  when  x  is  a  rational 
fraction  p/q.  Then  4>(x)  has  the  lower  bound  0,  but  no  upper  bound,  in  any 
interval  [a,b\.  But  if  4>(x)  =  (— 1  )pq  when  x  =  p/q,  then  4>(x)  has  neither  an 
upper  nor  a  lower  bound  in  any  interval. 


103.  The  oscillation  of  a  function  in  an  interval.  Let  4>(x)  be 

any  function  bounded  throughout  [a,  b ],  and  M  and  m  its  upper  and  lower 
bounds.  We  shall  now  use  the  notation  M(a,  b ),  m(a,  b)  for  M,  m,  in  order 
to  exhibit  explicitly  the  dependence  of  M  and  m  on  a  and  b ,  and  we  shall 
write 

0(a,  b)  =  M(a,  b)  —  m(a,  b). 

This  number  0(a,  b),  the  difference  between  the  upper  and  lower  bounds 
of  <p{x)  in  [a,  b],  we  shall  call  the  oscillation  o/0(x)  in  [a,  b\.  The  simplest 
of  the  properties  of  the  functions  M(a,b),  m(a,b),  0(a,b)  are  as  follows. 


See  §  104. 
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(1)  If  a  ^  c  ^  b  then  M(a,b )  is  equal  to  the  greater  of  M(a,c ) 
and  M(c,b),  and  m(a,b )  to  the  lesser  of  m(a,c)  and  m(c,b). 

(2)  M(a,b )  is  an  increasing,  m(a,b )  a  decreasing,  and  0(a,b )  an  in¬ 
creasing  function  of  b. 

(3)  0(a,b)  ^  0(a,  c)  +  0(c,b) . 

The  first  two  theorems  are  almost  immediate  consequences  of  our  def¬ 
initions.  Let  /i  be  the  greater  of  M(a,c)  and  M(c,  5),  and  let  e  be  any 
positive  number.  Then  <f>fx)  ^  ji  throughout  [a,  c]  and  [c,  b],  and  there¬ 
fore  throughout  [a,  6];  and  (p(x )  >  fi  —  e  somewhere  in  [a,  c]  or  in  [c,  b\, 
and  therefore  somewhere  in  [a,  b\.  Hence  M(a,b )  =  /i.  The  proposition 
concerning  m  may  be  proved  similarly.  Thus  (1)  is  proved,  and  (2)  is  an 
obvious  corollary. 

Suppose  now  that  Mi  is  the  greater  and  M2  the  less  of  M(a,  c ) 
and  M(c,b),  and  that  mi  is  the  less  and  m2  the  greater  of  m(a,c ) 
and  m(c,b).  Then,  since  c  belongs  to  both  intervals,  <f>(c )  is  not  greater 
than  M2  nor  less  than  m2.  Hence  M2  L;  m2,  whether  these  numbers 
correspond  to  the  same  one  of  the  intervals  [a,  c]  and  [c,b]  or  not,  and 


0(a,  b)  =  Mi  -  mi  ^  Mi  +  M2  —  mi  —  m2. 


But 


0(a,  c )  +  0(c,  b) 


and  (3)  follows. 


Mi  +  M2  -  mi  -  m2; 


104.  Alternative  proofs  of  Theorem  2  of  §  102.  The  most  straight¬ 
forward  proof  of  Theorem  2  of  §  102  is  as  follows.  Let  £  be  any  number  of  the 
interval  [a,  b\.  The  function  M(a,  £)  increases  steadily  with  £  and  never  ex¬ 
ceeds  M.  We  can  therefore  construct  a  section  of  the  numbers  £  by  putting  £ 
in  L  or  in  R  according  as  M(a,£)  <  M  or  M(a,£)  =  M.  Let  (3  be  the  number 
corresponding  to  the  section.  If  a  <  [3  <  b,  we  have 

Af(o,  P-rj)<  M,  M (a,  J3  +  t/)  =  M 
for  all  positive  values  of  g,  and  so 


M (f3  -r],P  +  rj)  =  M, 
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by  (1)  of  §  103.  Hence  f>{x)  assumes,  for  values  of  x  as  near  as  we  please  to  (3, 
values  as  near  as  we  please  to  M,  and  so,  since  <j)(x)  is  continuous,  4>((3)  must 
be  equal  to  M. 

If  /3  =  a  then  M(a,  a  +  rj)  =  M.  And  if  f3  =  b  then  M (a,  b  —  if)  <  M,  and  so 
M(b  —  i /,  b)  =  M.  In  either  case  the  argument  may  be  completed  as  before. 

The  theorem  may  also  be  proved  by  the  method  of  repeated  bisection  used 
in  §  71.  If  M  is  the  upper  bound  of  f(x )  in  an  interval  PQ,  and  PQ  is  divided 
into  two  equal  parts,  then  it  is  possible  to  find  a  half  P\Q\  in  which  the  upper 
bound  of  cj)(x )  is  also  M.  Proceeding  as  in  §  71,  we  construct  a  sequence  of 
intervals  PQ,  P\Q\,  P2Q21  ...  in  each  of  which  the  upper  bound  of  f{x )  is  M. 
These  intervals,  as  in  §  71,  converge  to  a  point  T,  and  it  is  easily  proved  that 
the  value  of  f(x )  at  this  point  is  M. 

105.  Sets  of  intervals  on  a  line.  The  Heine-Borel  Theorem. 

We  shall  now  proceed  to  prove  some  theorems  concerning  the  oscillation 
of  a  function  which  are  of  a  somewhat  abstract  character  but  of  very  great 
importance,  particularly,  as  we  shall  see  later,  in  the  theory  of  integration. 
These  theorems  depend  upon  a  general  theorem  concerning  intervals  on  a 
line. 

Suppose  that  we  are  given  a  set  of  intervals  in  a  straight  line,  that  is  to 
say  an  aggregate  each  of  whose  members  is  an  interval  [a,  /?].  We  make  no 
restriction  as  to  the  nature  of  these  intervals;  they  may  be  finite  or  infinite 
in  number;  they  may  or  may  not  overlap;*  and  any  number  of  them  may 
be  included  in  others. 

It  is  worth  while  in  passing  to  give  a  few  examples  of  sets  of  intervals  to 
which  we  shall  have  occasion  to  return  later. 

(i)  If  the  interval  [0, 1]  is  divided  into  n  equal  parts  then  the  n  intervals 
thus  formed  define  a  finite  set  of  non-overlapping  intervals  which  just  cover  up 
the  line. 

(ii)  We  take  every  point  £  of  the  interval  [0, 1],  and  associate  with  £  the 
interval  [£  —  5,  £  +  5],  where  6  is  a  positive  number  less  than  1,  except  that  with  0 
we  associate  [0,  d]  and  with  1  we  associate  [1  —  5, 1],  and  in  general  we  reject  any 

*The  word  overlap  is  used  in  its  obvious  sense:  two  intervals  overlap  if  they  have 
points  in  common  which  are  not  end  points  of  either.  Thus  [0,  |]  and  [1, 1]  overlap.  A 
pair  of  intervals  such  as  [0,  |]  and  [*,  1]  may  be  said  to  abut. 
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part  of  any  interval  which  projects  outside  the  interval  [0, 1].  We  thus  define  an 
infinite  set  of  intervals,  and  it  is  obvious  that  many  of  them  overlap  with  one 
another. 

(iii)  We  take  the  rational  points  p/q  of  the  interval  [0,1],  and  associate 
with  p/q  the  interval 

p  dp  6 
q  q3'  q  +  q3 

where  6  is  positive  and  less  than  1.  We  regard  0  as  0/1  and  1  as  1/1:  in  these 
two  cases  we  reject  the  part  of  the  interval  which  lies  outside  [0, 1].  We  obtain 
thus  an  infinite  set  of  intervals,  which  plainly  overlap  with  one  another,  since 
there  are  an  infinity  of  rational  points,  other  than  p/q,  in  the  interval  associated 
with  p/q. 

The  Heine-Borel  Theorem.  Suppose  that  we  are  given  an  inter¬ 
val  [a,  h],  and  a  set  of  intervals  I  each  of  whose  members  is  included  in  [a,  b\. 
Suppose  further  that  I  possesses  the  following  properties: 

(i)  every  point  of  [ a,b ],  other  than  a  and  b,  lies  inside*  at  least  one 
interval  of  I ; 

(ii)  a  is  the  left-hand  end  point,  and  b  the  right-hand  end  point,  of  at 
least  one  interval  of  I. 

Then  it  is  possible  to  choose  a  finite  number  of  intervals  from  the 
set  I  which  form  a  set  of  intervals  possessing  the  properties  (i)  and  (ii) . 

We  know  that  a  is  the  left-hand  end  point  of  at  least  one  interval  of  I, 
say  [a,ai].  We  know  also  that  Gq  lies  inside  at  least  one  interval  of  I, 
say  [of ,  a2] .  Similarly  eq  lies  inside  an  interval  [02,03]  of  I.  ft  is  plain  that 
this  argument  may  be  repeated  indefinitely,  unless  after  a  finite  number  of 
steps  an  coincides  with  b. 

If  an  does  coincide  with  b  after  a  finite  number  of  steps  then  there  is 
nothing  further  to  prove,  for  we  have  obtained  a  finite  set  of  intervals, 
selected  from  the  intervals  of  /,  and  possessing  the  properties  required.  If 
an  never  coincides  with  b,  then  the  points  cq,  cq,  03,  ...  must  (since  each 
lies  to  the  right  of  its  predecessor)  tend  to  a  limiting  position,  but  this 
limiting  position  may,  so  far  as  we  can  tell,  lie  anywhere  in  [a,  b]. 

*That  is  to  say  ‘in  and  not  at  an  end  of’. 
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Let  us  suppose  now  that  the  process  just  indicated,  starting  from  a,  is 
performed  in  all  possible  ways,  so  that  we  obtain  all  possible  sequences  of 
the  type  ai,  02,  03,  ....  Then  we  can  prove  that  there  must  be  at  least  one 
such  sequence  which  arrives  at  b  after  a  finite  number  of  steps. 


a  a[  ax  a'2  a2  £  a3  £'  £0  £"  h  b 


Fig.  33. 


There  are  two  possibilities  with  regard  to  any  point  £  between  a  and  b. 
Either  (i)  £  lies  to  the  left  of  some  point  an  of  some  sequence  or  (ii)  it 
does  not.  We  divide  the  points  £  into  two  classes  L  and  R  according  as  to 
whether  (i)  or  (ii)  is  true.  The  class  L  certainly  exists,  since  all  points  of 
the  interval  [a,  ai]  belong  to  L.  We  shall  now  prove  that  R  does  not  exist, 
so  that  every  point  £  belongs  to  L. 

If  R  exists  then  L  lies  entirely  to  the  left  of  R ,  and  the  classes  L,  R 
form  a  section  of  the  real  numbers  between  a  and  b,  to  which  corresponds 
a  number  £0.  The  point  £0  lies  inside  an  interval  of  /,  say  [£',£"],  and 
£'  belongs  to  L,  and  so  lies  to  the  left  of  some  term  an  of  some  sequence. 
But  then  we  can  take  [£',  £"]  as  the  interval  [a!n,  an+ 1]  associated  with  an  in 
our  construction  of  the  sequence  Gp,  a2,  a3,  . . . ;  and  all  points  to  the  left 
of  £"  lie  to  the  left  of  an+\.  There  are  therefore  points  of  L  to  the  right 
of  £0,  and  this  contradicts  the  definition  of  R.  It  is  therefore  impossible 
that  R  should  exist. 

Thus  every  point  £  belongs  to  L.  Now  b  is  the  right-hand  end  point  of 
an  interval  of  I,  say  [bi,b\,  and  b\  belongs  to  L.  Hence  there  is  a  member  an 
of  a  sequence  ai,  a2,  a3,  ...  such  that  an  >  b\.  But  then  we  may  take  the 
interval  [a'n,an+ 1]  corresponding  to  an  to  be  [61,6],  and  so  we  obtain  a 
sequence  in  which  the  term  after  the  nth  coincides  with  b,  and  therefore  a 
finite  set  of  intervals  having  the  properties  required.  Thus  the  theorem  is 
proved. 

It  is  instructive  to  consider  the  examples  of  p.  223  in  the  light  of  this  theorem. 
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(i)  Here  the  conditions  of  the  theorem  are  not  satisfied:  the  points 
1/n,  2/n,  3/n,  . . .  do  not  lie  inside  any  interval  of  I. 

(ii)  Here  the  conditions  of  the  theorem  are  satisfied.  The  set  of  intervals 

[0,25],  [5,35],  [25,45],  ...,  [1-25,1], 


associated  with  the  points  5,  25,  35,  . . . ,  1  —  5,  possesses  the  properties  required. 

(iii)  In  this  case  we  can  prove,  by  using  the  theorem,  that  there  are,  if  5  is 
small  enough,  points  of  [0, 1]  which  do  not  lie  in  any  interval  of  /. 

If  every  point  of  [0, 1]  lay  inside  an  interval  of  I  (with  the  obvious  reservation 
as  to  the  end  points),  then  we  could  find  a  finite  number  of  intervals  of  I  possess¬ 
ing  the  same  property  and  having  therefore  a  total  length  greater  than  1.  Now 
there  are  two  intervals,  of  total  length  25,  for  which  q  =  1,  and  q  —  1  intervals, 
of  total  length  25 (q  —  1  )/g3,  associated  with  any  other  value  of  q.  The  sum  of 
any  finite  number  of  intervals  of  I  can  therefore  not  be  greater  than  25  times 
that  of  the  series 


(1  2  3 

1  +  P  +  33  +  43  +  "  '  ’ 


which  will  be  shown  to  be  convergent  in  Ch.  VIII.  Hence  it  follows  that,  if  5  is 
small  enough,  the  supposition  that  every  point  of  [0, 1]  lies  inside  an  interval  of  I 
leads  to  a  contradiction. 

The  reader  may  be  tempted  to  think  that  this  proof  is  needlessly  elaborate, 
and  that  the  existence  of  points  of  the  interval,  not  in  any  interval  of  /,  follows 
at  once  from  the  fact  that  the  sum  of  all  these  intervals  is  less  than  1.  But  the 
theorem  to  which  he  would  be  appealing  is  (when  the  set  of  intervals  is  infinite) 
far  from  obvious,  and  can  only  be  proved  rigorously  by  some  such  use  of  the 
Heine-Borel  Theorem  as  is  made  in  the  text. 


106.  We  shall  now  apply  the  Heine-Borel  Theorem  to  the  proof  of  two 
important  theorems  concerning  the  oscillation  of  a  continnous  function. 

Theorem  I.  Iffifx)  is  continuous  throughout  the  interval  [a,  b\,  then  we 
can  divide  [ a,b ]  into  a  finite  number  of  sub-intervals  [a, or],  [or, £2],  •••? 
[xn,b],  in  each  of  which  the  oscillation  of  <ffx)  is  less  than  an  assigned 
positive  number  e. 

Let  f  be  any  number  between  a  and  b.  Since  fi(x)  is  continuous  for 
x  =  f,  we  can  determine  an  interval  [£  —  5,  f  +  5]  such  that  the  oscillation 
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of  <p(x)  in  this  interval  is  less  than  e.  It  is  indeed  obvious  that  there  are 
an  infinity  of  such  intervals  corresponding  to  every  £  and  every  e,  for  if 
the  condition  is  satisfied  for  any  particular  value  of  S,  then  it  is  satisfied 
a  fortiori  for  any  smaller  value.  What  values  of  <5  are  admissible  will 
naturally  depend  upon  £;  we  have  at  present  no  reason  for  supposing  that 
a  value  of  6  admissible  for  one  value  of  £  will  be  admissible  for  another. 
We  shall  call  the  intervals  thus  associated  with  £  the  e-intervals  of  £. 

If  £  =  a  then  we  can  determine  an  interval  [a,  a  +  5],  and  so  an  infinity 
of  such  intervals,  having  the  same  property.  These  we  call  the  e-intervals 
of  a ,  and  we  can  define  in  a  similar  manner  the  e-intervals  of  b. 

Consider  now  the  set  /  of  intervals  formed  by  taking  all  the  e-intervals 
of  all  points  of  [a,  b\.  It  is  plain  that  this  set  satisfies  the  conditions  of 
the  Heine-Borel  Theorem;  every  point  interior  to  the  interval  is  interior  to 
at  least  one  interval  of  /,  and  a  and  b  are  end  points  of  at  least  one  such 
interval.  We  can  therefore  determine  a  set  I'  which  is  formed  by  a  finite 
number  of  intervals  of  /,  and  which  possesses  the  same  property  as  I  itself. 

The  intervals  which  compose  the  set  I'  will  in  general  overlap  as  in 
Fig.  34.  But  their  end  points  obviously  divide  up  [a,  b]  into  a  finite  set  of 


a 


b 


Fig.  34. 

intervals  I"  each  of  which  is  included  in  an  interval  of  If  and  in  each  of 
which  the  oscillation  of  <p(x)  is  less  than  e.  Thus  Theorem  I  is  proved. 

Theorem  II.  Given  any  positive  number  e,  we  can  find  a  number  r) 
such  that,  if  the  interval  [a,  b]  is  divided  in  any  manner  into  sub-intervals 
of  length  less  than  rj,  then  the  oscillation  of  <f{x)  in  each  of  them  will  be 
less  than  e. 

Take  e±  <  \e,  and  construct,  as  in  Theorem  I,  a  finite  set  of  sub¬ 
intervals  j  in  each  of  which  the  oscillation  of  <f>(x)  is  less  than  e\.  Let  rj  be 
the  length  of  the  least  of  these  sub-intervals  j.  If  now  we  divide  [a,  b]  into 
parts  each  of  length  less  than  r/,  then  any  such  part  must  lie  entirely  within 
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at  most  two  successive  sub-intervals  j.  Hence,  in  virtue  of  (3)  of  §  103,  the 
oscillation  of  in  one  of  the  parts  of  length  less  than  r],  cannot  exceed 
twice  the  greatest  oscillation  of  <f>{x)  in  a  sub- interval  j ,  and  is  therefore 
less  than  2ei,  and  therefore  than  e. 

This  theorem  is  of  fundamental  importance  in  the  theory  of  definite 
integrals  (Ch.  VII).  It  is  impossible,  without  the  use  of  this  or  some  sim¬ 
ilar  theorem,  to  prove  that  a  function  continuous  throughout  an  interval 
necessarily  possesses  an  integral  over  that  interval. 

107.  Continuous  functions  of  several  variables.  The  notions  of 
continuity  and  discontinuity  may  be  extended  to  functions  of  several  inde¬ 
pendent  variables  (Ch.  II,  §§  31  et  seq.).  Their  application  to  such  functions 
however,  raises  questions  much  more  complicated  and  difficult  than  those 
which  we  have  considered  in  this  chapter.  It  would  be  impossible  for  us 
to  discuss  these  questions  in  any  detail  here;  but  we  shall,  in  the  sequel, 
require  to  know  what  is  meant  by  a  continuous  function  of  two  variables, 
and  we  accordingly  give  the  following  definition.  It  is  a  straightforward 
generalisation  of  the  last  form  of  the  definition  of  §  98. 

The  function  f(x,  y )  of  the  two  variables  x  and  y  is  said  to  be  contin¬ 
uous  for  x  —  £,  y  —  rj  if,  given  any  positive  number  e,  however  small,  we 
can  choose  5(e)  so  that 

1 4>{x,y)  -  <  e 

when  0  ^  \x  —  £|  ^  5(e)  and  0  ^  \y  —  rj\  V  5(e);  that  is  to  say  if  we 
can  draw  a  square,  whose  sides  are  parallel  to  the  axes  of  coordinates  and 
of  length  25(e)  ,  whose  centre  is  the  point  (£,  ??),  and  which  is  such  that  the 
value  of  <f>(x,  y)  at  any  point  inside  it  or  on  its  boundary  differs  from  0(£,  rj) 
by  less  than  e* 

This  definition  of  course  presupposes  that  cp(x,  y)  is  defined  at  all  points 
of  the  square  in  question,  and  in  particular  at  the  point  (£,77).  Another 
method  of  stating  the  definition  is  this:  (p(x,  y)  is  continuous  for  x  = 
y  —  rj  if  4>{x,  y )  — *  0(£,  rf)  when  x  — >•  £,  y  — >  rj  in  any  manner.  This  state¬ 
ment  is  apparently  simpler;  but  it  contains  phrases  the  precise  meaning  of 


The  reader  should  draw  a  figure  to  illustrate  the  definition. 
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which  has  not  yet  been  explained  and  can  only  be  explained  by  the  help 
of  inequalities  like  those  which  occur  in  our  original  statement. 

It  is  easy  to  prove  that  the  sums,  the  products,  and  in  general  the  quo¬ 
tients  of  continuous  functions  of  two  variables  are  themselves  continuous. 
A  polynomial  in  two  variables  is  continuous  for  all  values  of  the  variables; 
and  the  ordinary  functions  of  x  and  y  which  occur  in  every-day  analysis 
are  generally  continuous,  i.e.  are  continuous  except  for  pairs  of  values  of 
x  and  y  connected  by  special  relations. 

The  reader  should  observe  carefully  that  to  assert  the  continuity  of  4>{x,  y ) 
with  respect  to  the  two  variables  x  and  y  is  to  assert  much  more  than  its  con¬ 
tinuity  with  respect  to  each  variable  considered  separately.  It  is  plain  that  if 
4>(x,  y)  is  continuous  with  respect  to  x  and  y  then  it  is  certainly  continuous  with 
respect  to  x  (or  y)  when  any  fixed  value  is  assigned  to  y  (or  x) .  But  the  converse 
is  by  no  means  true.  Suppose,  for  example,  that 


4>{x,y)  = 


2  xy 

x2  +  y2 


when  neither  x  nor  y  is  zero,  and  4>(x,  y)  =  0  when  either  x  or  y  is  zero.  Then 
if  y  has  any  fixed  value,  zero  or  not,  4>(x,  y)  is  a  continuous  function  of  x,  and 
in  particular  continuous  for  x  =  0;  for  its  value  when  x  =  0  is  zero,  and  it  tends 
to  the  limit  zero  as  x  — >  0.  In  the  same  way  it  may  be  shown  that  f(x,  y)  is  a 
continuous  function  of  y.  But  4>(x,  y)  is  not  a  continuous  function  of  x  and  y 
for  x  =  0,  y  =  0.  Its  value  when  x  =  0,  y  =  0  is  zero;  but  if  x  and  y  tend  to  zero 
along  the  straight  line  y  =  ax,  then 


<t>{x,y)  = 


2  a 


l  +  a1 


lim  4>{x,  y)  = 


2  a 


l  +  a2 


which  may  have  any  value  between  —1  and  1. 


108.  Implicit  functions.  We  have  already,  in  Ch.  II,  met  with  the  idea 
of  an  implicit  function.  Thus,  if  x  and  y  are  connected  by  the  relation 


V  -xy-y-x  =  0, 


(1) 


then  y  is  an  ‘implicit  function’  of  x. 

But  it  is  far  from  obvious  that  such  an  equation  as  this  does  really  define  a 
function  y  of  x,  or  several  such  functions.  In  Ch.  II  we  were  content  to  take  this 
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for  granted.  We  are  now  in  a  position  to  consider  whether  the  assumption  we 
made  then  was  justified. 

We  shall  find  the  following  terminology  useful.  Suppose  that  it  is  possible 
to  surround  a  point  (a,  b),  as  in  §  107,  with  a  square  throughout  which  a  certain 
condition  is  satisfied.  We  shall  call  such  a  square  a  neighbourhood  of  (a,  b), 
and  say  that  the  condition  in  question  is  satisfied  in  the  neighbourhood  of  (a,  b ), 
or  near  ( a,b ),  meaning  by  this  simply  that  it  is  possible  to  find  some  square 
throughout  which  the  condition  is  satisfied.  It  is  obvious  that  similar  language 
may  be  used  when  we  are  dealing  with  a  single  variable,  the  square  being  replaced 
by  an  interval  on  a  line. 

Theorem.  If  (i)  /(x,  y )  is  a  continuous  function  of  x  and  y  in  the  neigh¬ 

bourhood  of  (a,  b), 

(ii)  f(a,b)  =  0, 

(iii)  f(x,y)  is,  for  all  values  of  x  in  the  neighbourhood  of  a,  a  steadily 
increasing  function  of  y,  in  the  stricter  sense  o/§  95, 

then  (1)  there  is  a  unique  function  y  =  4>(x)  which,  when  substituted  in  the 
equation  f(x,y)  =  0,  satisfies  it  identically  for  all  values  of  x  in  the  neighbour¬ 
hood  of  a, 

(2)  4>(x)  is  continuous  for  all  values  of  x  in  the  neighbourhood  of  a. 

In  the  figure  the  square  represents  a  ‘neighbourhood’  of  (a,  b )  throughout 
which  the  conditions  (i)  and  (iii)  are  satisfied,  and  P  the  point  (a,  b).  If  we  take 


Q  and  R  as  in  the  figure,  it  follows  from  (iii)  that  f(x,y)  is  positive  at  Q  and 
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negative  at  R.  This  being  so,  and  f(x,y)  being  continuous  at  Q  and  at  R,  we 
can  draw  lines  QQ'  and  RR'  parallel  to  OX ,  so  that  R'Q'  is  parallel  to  OY  and 
f(x,y)  is  positive  at  all  points  of  QQ'  and  negative  at  all  points  of  RR' .  In 
particular  f(x,y)  is  positive  at  Q'  and  negative  at  R' ,  and  therefore,  in  virtue 
of  (iii)  and  §  100,  vanishes  once  and  only  once  at  a  point  P'  on  R'Q' .  The 
same  construction  gives  us  a  unique  point  at  which  f(x,y )  =  0  on  each  ordinate 
between  RQ  and  R'Q' .  It  is  obvious,  moreover,  that  the  same  construction  can 
be  carried  out  to  the  left  of  RQ.  The  aggregate  of  points  such  as  P'  gives  us  the 
graph  of  the  required  function  y  =  <f>(x). 

It  remains  to  prove  that  <f>(x)  is  continuous.  This  is  most  simply  effected  by 
using  the  idea  of  the  ‘limits  of  indetermination’  of  <p(x)  as  x  — >  a  (§  96).  Suppose 
that  x  — >  a,  and  let  A  and  A  be  the  limits  of  indetermination  of  4>{x)  as  x  — >  a. 
It  is  evident  that  the  points  (a,  A)  and  (a,  A)  lie  on  QR.  Moreover,  we  can  find 
a  sequence  of  values  of  x  such  that  cj>{x)  — >  A  when  x  — >  a  through  the  values  of 
the  sequence;  and  since  f{x,(j)(x)}  =  0,  and  f(x,  y )  is  a  continuous  function  of 
x  and  y,  we  have 

/(a,  A)  =  0. 

Hence  A  =  6;  and  similarly  A  =  b.  Thus  4>(x)  tends  to  the  limit  b  as  x  — >  a,  and 
so  (j)(x)  is  continuous  for  x  =  a.  It  is  evident  that  we  can  show  in  exactly  the 
same  way  that  < j)(x)  is  continuous  for  any  value  of  x  in  the  neighbourhood  of  a. 

It  is  clear  that  the  truth  of  the  theorem  would  not  be  affected  if  we  were  to 
change  ‘increasing’  to  ‘decreasing’  in  condition  (iii). 

As  an  example,  let  us  consider  the  equation  (1),  taking  a  =  0,  b  =  0.  It  is 
evident  that  the  conditions  (i)  and  (ii)  are  satisfied.  Moreover 

f(x,  y )  -  f(x,  y')  =  {y-  y')(y4  +  y3y'  +  y2y'2  +  yy '3  +  y4  -  x  -  1) 

has,  when  x,  y,  and  y'  are  sufficiently  small,  the  sign  opposite  to  that  of  y  —  y' . 
Hence  condition  (iii)  (with  ‘decreasing’  for  ‘increasing’)  is  satisfied.  It  follows 
that  there  is  one  and  only  one  continuous  function  y  which  satisfies  the  equa¬ 
tion  (1)  identically  and  vanishes  with  x. 

The  same  conclusion  would  follow  if  the  equation  were 

y2  —  xy  —  y  —  x  =  0. 

The  function  in  question  is  in  this  case 

y  =  \{l  +  x  -  y/l  +  6x  +  x2}, 
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where  the  square  root  is  positive.  The  second  root,  in  which  the  sign  of  the 
square  root  is  changed,  does  not  satisfy  the  condition  of  vanishing  with  x. 

There  is  one  point  in  the  proof  which  the  reader  should  be  careful  to  observe. 
We  supposed  that  the  hypotheses  of  the  theorem  were  satisfied  ‘in  the  neigh¬ 
bourhood  of  (a,  b)\  that  is  to  say  throughout  a  certain  square  £  —  8  ^  x  ^  +  5, 

rj  —  5  ^  y  ^  r]  +  5.  The  conclusion  holds  ‘in  the  neighbourhood  of  x  =  a\  that 
is  to  say  throughout  a  certain  interval  £  —  <5i  =  x  ^  £  +  5\ .  There  is  nothing  to 
show  that  the  <5i  of  the  conclusion  is  the  5  of  the  hypotheses,  and  indeed  this  is 
generally  untrue. 


109.  Inverse  Functions.  Suppose  in  particular  that  f(x,y)  is  of  the 
form  F(y)  —  x.  We  then  obtain  the  following  theorem. 

If  F(y)  is  a  function  ofy,  continuous  and  steadily  increasing  ( or  decreasing ), 
in  the  stricter  sense  of  %  95,  in  the  neighbourhood  of  y  =  b,  and  F(b)  =  a,  then 
there  is  a  unique  continuous  function  y  =  <f(x)  which  is  equal  to  b  when  x  =  a 
and  satisfies  the  equation  F(y)  =  x  identically  in  the  neighbourhood  of  x  =  a. 

The  function  thus  defined  is  called  the  inverse  function  of  F(y). 

Suppose  for  example  that  y3  =  x,  a  =  0,  b  =  0.  Then  all  the  conditions  of 
the  theorem  are  satisfied.  The  inverse  function  is  x  =  ^fy. 

If  we  had  supposed  that  y2  =  x  then  the  conditions  of  the  theorem  would 
not  have  been  satisfied,  for  y2  is  not  a  steadily  increasing  function  of  y  in  any 
interval  which  includes  y  =  0:  it  decreases  when  y  is  negative  and  increases  when 
y  is  positive.  And  in  this  case  the  conclusion  of  the  theorem  does  not  hold,  for 
y2  =  x  defines  two  functions  of  x,  viz.  y  =  sjx  and  y  =  —y/x,  both  of  which 
vanish  when  x  =  0,  and  each  of  which  is  defined  only  for  positive  values  of  x,  so 
that  the  equation  has  sometimes  two  solutions  and  sometimes  none.  The  reader 
should  consider  the  more  general  equations 


y2n  =  x , 


y2n+1  =  x, 


in  the  same  way.  Another  interesting  example  is  given  by  the  equation 

y5  ~  y  ~  x  =  0, 


already  considered  in  Ex.  xiv.  7. 
Similarly  the  equation 


sm  y  =  x 
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has  just  one  solution  which  vanishes  with  x,  viz.  the  value  of  arc  sin  x  which 
vanishes  with  x.  There  are  of  course  an  infinity  of  solutions,  given  by  the  other 
values  of  arc  sin  x  (cf.  Ex.  XV.  10),  which  do  not  satisfy  this  condition. 

So  far  we  have  considered  only  what  happens  in  the  neighbourhood  of  a 
particular  value  of  x.  Let  us  suppose  now  that  F(y)  is  positive  and  steadily  in¬ 
creasing  (or  decreasing)  throughout  an  interval  [a,  b ].  Given  any  point  £  of  [a,  b], 
we  can  determine  an  interval  i  including  £,  and  a  unique  and  continuous  inverse 
function  (j>i(x)  defined  throughout  i. 

From  the  set  I  of  intervals  i  we  can,  in  virtue  of  the  Heine-Borel  Theorem, 
pick  out  a  finite  sub-set  covering  up  the  whole  interval  [a,  b] ;  and  it  is  plain 
that  the  finite  set  of  functions  4>i(x),  corresponding  to  the  sub-set  of  intervals  i 
thus  selected,  define  together  a  unique  inverse  function  <f(x)  continuous  through¬ 
out  [a,  b\. 

We  thus  obtain  the  theorem:  if  x  =  F(y),  where  F{y )  is  continuous  and 
increases  steadily  and  strictly  from  A  to  B  as  x  increases  from  a  to  b,  then  there 
is  a  unique  inverse  function  y  =  <f{x)  which  is  continuous  and  increases  steadily 
and  strictly  from  a  to  b  as  x  increases  from  A  to  B. 

It  is  worth  while  to  show  how  this  theorem  can  be  obtained  directly  without 
the  help  of  the  more  difficult  theorem  of  §  108.  Suppose  that  A  <  £  <  B,  and 
consider  the  class  of  values  of  y  such  that  (i )  a  <  y  <  b  and  (ii)  F(y)  ^  £.  This 
class  has  an  upper  bound  77,  and  plainly  F(rj)  £.  If  F(y)  were  less  than  £,  we 
could  find  a  value  of  y  such  that  y  >  r\  and  F(y)  <  £,  and  77  would  not  be  the 
upper  bound  of  the  class  considered.  Hence  F(rj)  =  £.  The  equation  F{y)  =  £ 
has  therefore  a  unique  solution  y  =  77  =  <^(£),  say;  and  plainly  77  increases 
steadily  and  continuously  with  £,  which  proves  the  theorem. 

MISCELLANEOUS  EXAMPLES  ON  CHAPTER  V. 

1.  Show  that,  if  neither  a  nor  b  is  zero,  then 

axn  +  bxn~l  +  ■  ■  ■  +  k  =  axn(  1  +  ex), 

where  ex  is  of  the  first  order  of  smallness  when  x  is  large. 

2.  If  P(x)  =  axn  +  bx n~1  +  •■■  +  &,  and  a  is  not  zero,  then  as  x  increases 
P(x)  has  ultimately  the  sign  of  a;  and  so  has  P{x  +  A)  —  P(x),  where  A  is  any 
constant. 
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3.  Show  that  in  general 

(axn  +  bxn~1  +  •  •  •  +  k)/(Axn  +  Bxn~l  +  ■  ■  ■  +  K)  =  a  +  (/3/x)(  1  +  ex), 

where  a  =  a/ A,  (3  =  (bA  —  aB) /A2 ,  and  ex  is  of  the  first  order  of  smallness  when 
x  is  large.  Indicate  any  exceptional  cases. 

4.  Express 

(ax2  +  bx  +  c)/(Ax 2  +  Bx  +  C ) 

in  the  form 

a  +  (/3/x)  +  (^/x2)(l  +  ex), 

where  ex  is  of  the  first  order  of  smallness  when  x  is  large. 

5.  Show  that 

lim  y/x{y/x  +  a  —  y/x}  =  io. 

[Use  the  formula  y/x  +  a  —  y/x  =  a/{y/x  +  a  +  yfx }.] 

6.  Show  that  y/x  +  a  =  yfx  A  \  (a/ y/x)  (1  +  ex),  where  ex  is  of  the  first  order 
of  smallness  when  x  is  large. 

7.  Find  values  of  a  and  (3  such  that  yj ax2  +  2 bx  +  c  —  ax  —  (3  has  the  limit 
zero  as  x  — >  oo;  and  prove  that  lim  x{  yj  ax 2  +  2  bx  +  c  —  ax  —  (3}  =  ( ac  —  b2)/2a. 

8.  Evaluate 

lim  x  <  y  x2  +  V x4  +  1  —  xV2 

9.  Prove  that  (sec  a:  —  tan.x)  — >•  0  as  x  — > 

10.  Prove  that  4>(x)  =  1  —  cos(l  —  cosx)  is  of  the  fourth  order  of  smallness 
when  x  is  small;  and  find  the  limit  of  (f)(x)/xA  as  x  — >  0. 

11.  Prove  that  4>(x)  =  xsin(sin.x)  —  sin2  x  is  of  the  sixth  order  of  smallness 
when  x  is  small;  and  find  the  limit  of  (j)(x)/xe  as  x  — >  0. 

12.  From  a  point  P  on  a  radius  OA  of  a  circle,  produced  beyond  the  cir¬ 
cle,  a  tangent  PT  is  drawn  to  the  circle,  touching  it  in  T,  and  TN  is  drawn 
perpendicular  to  OA.  Show  that  NA/AP  — >  1  as  P  moves  up  to  A. 

13.  Tangents  are  drawn  to  a  circular  arc  at  its  middle  point  and  its  ex¬ 
tremities;  A  is  the  area  of  the  triangle  formed  by  the  chord  of  the  arc  and  the 
two  tangents  at  the  extremities,  and  A'  the  area  of  that  formed  by  the  three 
tangents.  Show  that  A/ A'  — >  4  as  the  length  of  the  arc  tends  to  zero. 
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14.  For  what  values  of  a  does  {a  +  sin(l/x)}/x  tend  to  (1)  oo,  (2)  —  oo,  as 
x  — y  0?  [To  oo  if  a  >  1,  to  — oo  if  a  <  —1:  the  function  oscillates  if  —1  ^  a  ^  1.] 

15.  If  (f)(x )  =  1/q  when  x  =  p/q,  and  4>(x)  =  0  when  x  is  irrational,  then 
( p(x )  is  continuous  for  all  irrational  and  discontinuous  for  all  rational  values  of  x. 

16.  Show  that  the  function  whose  graph  is  drawn  in  Fig.  32  may  be  repre¬ 
sented  by  either  of  the  formulae 

1  —  x  +  [x]  —  [1  —  x],  1  —  x  —  lim  (cos2n+1  ttx). 

n— >oo 

17.  Show  that  the  function  fi(x)  which  is  equal  to  0  when  x  =  0,  to  \  —  x 
when  0  <  x  <  to  \  when  x  =  to  |  —  x  when  1  <  x  <  1,  and  to  1  when 
x  =  1,  assumes  every  value  between  0  and  1  once  and  once  only  as  x  increases 
from  0  to  1,  but  is  discontinuous  for  x  =  0,  x  =  and  x  =  1.  Show  also  that 
the  function  may  be  represented  by  the  formula 

I  -x- \[2x\  -  §[1-2®]. 

18.  Let  <f>(x)  =  x  when  x  is  rational  and  <j>(x)  =  1  —  x  when  x  is  irrational. 
Show  that  (j>(x)  assumes  every  value  between  0  and  1  once  and  once  only  as 
x  increases  from  0  to  1,  but  is  discontinuous  for  every  value  of  x  except  x  =\. 

19.  As  x  increases  from  —  jvr  to  \tt,  y  =  sins  is  continuous  and  steadily 
increases,  in  the  stricter  sense,  from  —1  to  1.  Deduce  the  existence  of  a  function 
x  =  arc  sin  y  which  is  a  continuous  and  steadily  increasing  function  of  y  from 
y  =  -1  to  y  =  1. 

20.  Show  that  the  numerically  least  value  of  arc  tan  y  is  continuous  for  all 
values  of  y  and  increases  steadily  from  —  ^vr  to  }j7r  as  y  varies  through  all  real 
values. 

21.  Discuss,  on  the  lines  of  §§  108-109,  the  solution  of  the  equations 

y  —  y  —  x  =  0,  y  -y  -x  =  0,  y  -y  +X  =  0 
in  the  neighbourhood  of  x  =  0,  y  =  0. 

22.  If  ax 2  +  2 bxy  +  cy 2  +  2 dx  +  2 ey  =  0  and  A  =  2 bde  —  ae 2  —  cd2,  then  one 
value  of  y  is  given  by  y  =  ax  +  fix2  +  (7  +  ex)x 3,  where 


a  =  — d/e,  fi  =  A/2e3,  7  =  (cd  —  6e)A/2e5, 
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and  ex  is  of  the  first  order  of  smallness  when  x  is  small. 

[If  y  —  ax  =  77  then 

—2  erj  =  ax2  +  2  bx(r)  +  ax)  +  c(rj  +  ax)2  =  Ax2  +  2Bxr]  +  Crj2 , 

say.  It  is  evident  that  77  is  of  the  second  order  of  smallness,  xr)  of  the  third,  and 
rj2  of  the  fourth;  and  —2 erj  =  Ax 2  —  (AB /e)x3 ,  the  error  being  of  the  fourth 
order.] 

23.  If  x  =  ay  +  by2  +  cy3  then  one  value  of  y  is  given  by 

y  =  ax  +  fix2  +  (7  +  ex)x 3, 

where  a  =  1/a,  fi  =  —b/a3,  7  =  ( 2b 2  —  ac)/a5,  and  ex  is  of  the  first  order  of 
smallness  when  x  is  small. 

24.  If  x  =  ay  +  byn,  where  n  is  an  integer  greater  than  unity,  then  one  value 
of  y  is  given  by  y  =  ax  +  fixn  +  (7  +  ex)x2n~1,  where  a  =  l/a,  fi  =  — b/an+1 , 
7  =  nb2 / a2n+l ,  and  ex  is  of  the  (n  —  l)th  order  of  smallness  when  x  is  small. 

25.  Show  that  the  least  positive  root  of  the  equation  xy  =  sin  a;  is  a  con¬ 
tinuous  function  of  y  throughout  the  interval  [0, 1],  and  decreases  steadily  from 
7r  to  0  as  y  increases  from  0  to  1.  [The  function  is  the  inverse  of  (sin  x) /x: 
apply  §  109.] 

26.  The  least  positive  root  of  xy  =  tanx  is  a  continuous  function  of  y 
throughout  the  interval  [1, 00),  and  increases  steadily  from  0  to  ^ir  as  y  increases 
from  1  towards  00. 


CHAPTER  VI 


DERIVATIVES  AND  INTEGRALS 

110.  Derivatives  or  Differential  Coefficients.  Let  us  return  to 
the  consideration  of  the  properties  which  we  naturally  associate  with  the 
notion  of  a  curve.  The  first  and  most  obvious  property  is,  as  we  saw  in 
the  last  chapter,  that  which  gives  a  curve  its  appearance  of  connectedness, 
and  which  we  embodied  in  our  definition  of  a  continuous  function. 

The  ordinary  curves  which  occur  in  elementary  geometry,  such  as 
straight  lines,  circles  and  conic  sections,  have  of  course  many  other  prop¬ 
erties  of  a  general  character.  The  simplest  and  most  noteworthy  of  these 
is  perhaps  that  they  have  a  definite  direction  at  every  point,  or  what  is 
the  same  thing,  that  at  every  point  of  the  curve  we  can  draw  a  tangent 
to  it.  The  reader  will  probably  remember  that  in  elementary  geometry 
the  tangent  to  a  curve  at  P  is  defined  to  be  ‘the  limiting  position  of  the 
chord  PQ,  when  Q  moves  up  towards  coincidence  with  P\  Let  us  consider 
what  is  implied  in  the  assumption  of  the  existence  of  such  a  limiting 
position. 

In  the  figure  (Fig.  36)  P  is  a  fixed  point  on  the  curve,  and  Q  a  variable 
point;  PM,  QN  are  parallel  to  OY  and  PR  to  OX.  We  denote  the  coor¬ 
dinates  of  P  by  x,  y  and  those  of  Q  by  x  +  h,  y  +  k:  h  will  be  positive  or 
negative  according  as  N  lies  to  the  right  or  left  of  M. 

We  have  assumed  that  there  is  a  tangent  to  the  curve  at  P,  or  that 
there  is  a  definite  ‘limiting  position’  of  the  chord  PQ.  Suppose  that  PT, 
the  tangent  at  P,  makes  an  angle  0  with  OX.  Then  to  say  that  PT  is  the 
limiting  position  of  PQ  is  equivalent  to  saying  that  the  limit  of  the  angle 
QPR  is  0,  when  Q  approaches  P  along  the  curve  from  either  side.  We 
have  now  to  distinguish  two  cases,  a  general  case  and  an  exceptional  one. 

The  general  case  is  that  in  which  -0  is  not  equal  to  so  that  PT  is 
not  parallel  to  OY .  In  this  case  RPQ  tends  to  the  limit  0,  and 

RQj PR  =  tan  RPQ 
tends  to  the  limit  tan0.  Now 

RQ/PR  =  ( NQ  -  MP)/MN  =  {0(x  +  h)  -  <j>{x)}/h\ 
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Fig.  36. 


and  so 


(1) 


h — >0  h 


The  reader  should  be  careful  to  note  that  in  all  these  equations  all 
lengths  are  regarded  as  affected  with  the  proper  sign,  so  that  ( e.g .)  RQ  is 
negative  in  the  figure  when  Q  lies  to  the  left  of  P;  and  that  the  convergence 
to  the  limit  is  unaffected  by  the  sign  of  h. 

Thus  the  assumption  that  the  curve  which  is  the  graph  of  4>(x)  has  a 
tangent  at  P,  which  is  not  perpendicular  to  the  axis  of  x,  implies  that 
c t>(x )  has,  for  the  particular  value  of  x  corresponding  to  P,  the  property 
that  {(p(x  +  h )  —  cp(x)}/h  tends  to  a  limit  when  h  tends  to  zero. 

This  of  course  implies  that  both  of 


{cj)(x  +  h)-  4>(x)}/h,  {4>(x  -  h)  -  4>{x)}/(-h) 


tend  to  limits  when  h  — >  0  by  positive  values  only,  and  that  the  two  limits  are 
equal.  If  these  limits  exist  but  are  not  equal,  then  the  curve  y  =  d>(x)  has  an 
angle  at  the  particular  point  considered,  as  in  Fig.  37. 

Now  let  us  suppose  that  the  curve  has  (like  the  circle  or  ellipse)  a 
tangent  at  every  point  of  its  length,  or  at  any  rate  every  portion  of  its 
length  which  corresponds  to  a  certain  range  of  variation  of  x.  Further  let 
us  suppose  this  tangent  never  perpendicular  to  the  axis  of  x:  in  the  case 
of  a  circle  this  would  of  course  restrict  us  to  considering  an  arc  less  than 
a  semicircle.  Then  an  equation  such  as  (1)  holds  for  all  values  of  x  which 
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fall  inside  this  range.  To  each  such  value  of  x  corresponds  a  value  of  tan  if: 
tan  if  is  a  function  of  x,  which  is  defined  for  all  values  of  x  in  the  range  of 
values  under  consideration,  and  which  may  be  calculated  or  derived  from 
the  original  function  <f{x).  We  shall  call  this  function  the  derivative  or 
derived  function  of  (f(x),  and  we  shall  denote  it  by 


Another  name  for  the  derived  function  of  <f(x)  is  the  differential  co¬ 
efficient  of  (j>{x)\  and  the  operation  of  calculating  (f'{x)  from  (f>{x)  is  gen¬ 
erally  known  as  differentiation.  This  terminology  is  firmly  established 
for  historical  reasons:  see  §  115. 

Before  we  proceed  to  consider  the  special  case  mentioned  above,  in 
which  if  =  |7 r,  we  shall  illustrate  our  definition  by  some  general  remarks 
and  particular  illustrations. 

111.  Some  general  remarks.  (1)  The  existence  of  a  derived  func¬ 
tion  f>'{x)  for  all  values  of  x  in  the  interval  a  ^  x  5s  b  implies  that 
<f{x)  is  continuous  at  every  point  of  this  interval.  For  it  is  evident  that 
{0(x  +  h)  —  <f(x)}/h  cannot  tend  to  a  limit  unless  lim  <f(x  +  h)  =  <f(x),  and 
it  is  this  which  is  the  property  denoted  by  continuity. 

(2)  It  is  natural  to  ask  whether  the  converse  is  true,  i.e.  whether  every 
continuous  curve  has  a  definite  tangent  at  every  point,  and  every  function 


Fig.  37. 

a  differential  coefficient  for  every  value  of  x  for  which  it  is  continuous.*  The 


We  leave  out  of  account  the  exceptional  case  (which  we  have  still  to  examine)  in 
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answer  is  obviously  No:  it  is  sufficient  to  consider  the  curve  formed  by  two 
straight  lines  meeting  to  form  an  angle  (Fig.  37).  The  reader  will  see  at 
once  that  in  this  case  {4>(x  +  h)  —  (f>(x)}/h  has  the  limit  tan  /3  when  h  — >  0 
by  positive  values  and  the  limit  tan  a  when  h  — >  0  by  negative  values. 

This  is  of  course  a  case  in  which  a  curve  might  reasonably  be  said  to  have  two 
directions  at  a  point.  But  the  following  example,  although  a  little  more  difficult, 
shows  conclusively  that  there  are  cases  in  which  a  continuous  curve  cannot  be 
said  to  have  either  one  direction  or  several  directions  at  one  of  its  points.  Draw 
the  graph  (Fig.  14,  p.  61)  of  the  function  xsin(l/x).  The  function  is  not  defined 
for  x  =  0,  and  so  is  discontinuous  for  x  =  0.  On  the  other  hand  the  function 
defined  by  the  equations 


<f>(x)  =  xsin(l/x)  (i  /  0),  <f(x)  =  0  (x  =  0) 


is  continuous  for  x  =  0  (Exs.  xxxvn.  14,  15),  and  the  graph  of  this  function  is 
a  continuous  curve. 

But  <f>(x)  has  no  derivative  for  x  =  0.  For  <j/(0)  would  be,  by  definition, 
lim{<j>(/t)  —  (f)(0)} /h  or  lirri sin(l//i);  and  no  such  limit  exists. 

It  has  even  been  shown  that  a  function  of  x  may  be  continuous  and  yet  have 
no  derivative  for  any  value  of  x,  but  the  proof  of  this  is  much  more  difficult. 
The  reader  who  is  interested  in  the  question  may  be  referred  to  Bromwich’s 
Infinite  Series ,  pp.  490-1,  or  Hobson’s  Theory  of  Functions  of  a  Real  Variable , 
pp.  620-5. 


(3)  The  notion  of  a  derivative  or  differential  coefficient  was  suggested 
to  us  by  geometrical  considerations.  But  there  is  nothing  geometrical  in 
the  notion  itself.  The  derivative  <f>'(x)  of  a  function  <f(x)  may  be  defined, 
without  any  reference  to  any  kind  of  geometrical  representation  of  (f>(x), 
by  the  equation 


(f'(x) 


lim 

h-+  o 


(f){x  +  h)  —  (p(x) 
h  ’ 


and  <f)(x)  has  or  has  not  a  derivative,  for  any  particular  value  of  x ,  according 
as  this  limit  does  or  does  not  exist.  The  geometry  of  curves  is  merely  one 


which  the  curve  is  supposed  to  have  a  tangent  perpendicular  to  OX:  apart  from  this 
possibility  the  two  forms  of  the  question  stated  above  are  equivalent. 
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of  many  departments  of  mathematics  in  which  the  idea  of  a  derivative  finds 
an  application. 

Another  important  application  is  in  dynamics.  Suppose  that  a  particle  is 
moving  in  a  straight  line  in  such  a  way  that  at  time  t  its  distance  from  a  fixed 
point  on  the  line  is  s  =  4>(t).  Then  the  ‘velocity  of  the  particle  at  time  V  is  by 
definition  the  limit  of 

4>{t  +  h)  -  4>(t) 

as  h  — >  0.  The  notion  of  ‘velocity’  is  in  fact  merely  a  special  case  of  that  of  the 
derivative  of  a  function. 


Examples  XXXIX.  1.  If  cf>(x)  is  a  constant  then  (ft  {x)  =  0.  Interpret 
this  result  geometrically. 

2.  If  <f)(x)  =  ax  +  b  then  c ft(x )  =  a.  Prove  this  (i)  from  the  formal  definition 
and  (ii)  by  geometrical  considerations. 

3.  If  4>(x)  =  xm,  where  rn  is  a  positive  integer,  then  (f/(x)  =  rnxm~l . 

[For 


(ft  (x)  =  lim 


(x  +  h)m  -  xr 
h 


=  lim  {  mx™-1  +  — l^-xm~2h  +  •  •  •  +  h171-1 

'  1-2 


The  reader  should  observe  that  this  method  cannot  be  applied  to  xp/q,  where 
p/q  is  a  rational  fraction,  as  we  have  no  means  of  expressing  (x  +  h)p/q  as  a 
finite  series  of  powers  of  h.  We  shall  show  later  on  (§  118)  that  the  result  of 
this  example  holds  for  all  rational  values  of  m.  Meanwhile  the  reader  will  find  it 
instructive  to  determine  4>'(x)  when  m  has  some  special  fractional  value  ( e.g .  |), 
by  means  of  some  special  device.] 

4.  If  4>{x)  =  sin  x,  then  (/>' (x)  =  cosx;  and  if  (j>(x)  =  cosx,  then 
(ft  (x)  =  —  sinx. 

[For  example,  if  <fr(x)  =  sinx,  we  have 

{4>(x  +  h)  —  (j)(x)}/h  =  {2  sin  | h,  cos(x  +  \h)}/h, 

the  limit  of  which,  when  h  — >  0,  is  cosx,  since  limcos(x  +  \h)  =  cosx  (the  cosine 
being  a  continuous  function)  and  lim{(sin \h)/\h}  =  1  (Ex.  xxxvi.  13).] 
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5.  Equations  of  the  tangent  and  normal  to  a  curve  y  =  4>(x).  The 
tangent  to  the  curve  at  the  point  (xq,  yo)  is  the  line  through  (xo,  yo)  which  makes 
with  OX  an  angle  -0,  where  tan =  cj)'(xo).  Its  equation  is  therefore 

y-yo  =  (x-  x0)<^(x0); 

and  the  equation  of  the  normal  (the  perpendicular  to  the  tangent  at  the  point 
of  contact)  is 

(y  ~  Vo)(t>\xo)  +  x  -  x0  =  0. 

We  have  assumed  that  the  tangent  is  not  parallel  to  the  axis  of  y.  In  this  special 
case  it  is  obvious  that  the  tangent  and  normal  are  x  =  xo  and  y  =  yo  respectively. 

6.  Write  down  the  equations  of  the  tangent  and  normal  at  any  point  of  the 
parabola  x 2  =  4 ay.  Show  that  if  xo  =  2a/ m,  yo  =  a/m2,  then  the  tangent  at 
(x0,  yo)  is  x  =  my  +  (a/m). 


112.  We  have  seen  that  if  <f>(x)  is  not  continuous  for  a  value  of  x  then  it 
cannot  possibly  have  a  derivative  for  that  value  of  x.  Thus  such  functions 
as  1/x  or  sin(l/x),  which  are  not  defined  for  x  —  0,  and  so  necessarily 
discontinuous  for  x  —  0,  cannot  have  derivatives  for  x  —  0.  Or  again  the 
function  [x],  which  is  discontinuous  for  every  integral  value  of  x,  has  no 
derivative  for  any  such  value  of  x. 


Example.  Since  [t]  is  constant  between  every  two  integral  values  of  x,  its 
derivative,  whenever  it  exists,  has  the  value  zero.  Thus  the  derivative  of  [x], 
which  we  may  represent  by  [x]7,  is  a  function  equal  to  zero  for  all  values  of  x 

save  integral  values  and  undefined  for  integral  values.  It  is  interesting  to  note 

,  .  .  ,  simx  ,  ,  , 

that  the  function  1 - has  exactly  the  same  properties. 

sin  7 rx 

We  saw  also  in  Ex.  XXXVII.  7  that  the  types  of  discontinuity  which  occur 
most  commonly,  when  we  are  dealing  with  the  very  simplest  and  most  ob¬ 
vious  kinds  of  functions,  such  as  polynomials  or  rational  or  trigonometrical 
functions,  are  associated  with  a  relation  of  the  type 


4>(x)  -9-  Too 
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Fig.  38. 

or  <f(x)  — >  —  oo.  In  all  these  cases,  as  in  such  cases  as  those  considered 
above,  there  is  no  derivative  for  certain  special  values  of  x.  In  fact,  as  was 
pointed  out  in  §  111,  (1),  all  discontinuities  of  cf{x)  are  also  discontinuities 
of  (f>'{x).  But  the  converse  is  not  true,  as  we  may  easily  see  if  we  return  to 
the  geometrical  point  of  view  of  §  110  and  consider  the  special  case,  hitherto 
left  aside,  in  which  the  graph  of  <f>(x)  has  a  tangent  parallel  to  OY.  This 
case  may  be  subdivided  into  a  number  of  cases,  of  which  the  most  typical 
are  shown  in  Fig.  38.  In  cases  (c)  and  (d)  the  function  is  two  valued  on 
one  side  of  P  and  not  defined  on  the  other.  In  such  cases  we  may  consider 
the  two  sets  of  values  of  4>(x),  which  occur  on  one  side  of  P  or  the  other, 
as  defining  distinct  functions  <j>\(x)  and  02 (x),  the  upper  part  of  the  curve 
corresponding  to  (j>\{x). 

The  reader  will  easily  convince  himself  that  in  (a) 

{4>(x  T  h)  —  (p(x)}/h  — »  Too, 

as  h  — *  0,  and  in  ( b ) 

{4>{;x  T  h)  —  4>(x)}/h  — *  — oo; 

while  in  (c) 

{<fi{x  T  h)  —  (x)}/h  -A  Too,  {f>2(x  T  h)  —  (j)2(x)}/h  — >■  -oo, 
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and  in  (  d) 

{0i (x  +  h )  -  0i(x)}//i  -A  — oo,  {02(^  +  h)  —  02(:r)}//i  — >■  +oo, 

though  of  course  in  (c)  only  positive  and  in  (d)  only  negative  values  of  h 
can  be  considered,  a  fact  which  by  itself  would  preclude  the  existence  of  a 
derivative. 

We  can  obtain  examples  of  these  four  cases  by  considering  the  functions 
defined  by  the  equations 

(a)  y 3  =  x,  ( b )  y3  =  - x ,  (c)  y 2  =  x,  ( d )  y2  =  -x, 
the  special  value  of  x  under  consideration  being  x  —  0. 


113.  Some  general  rules  for  differentiation.  Throughout  the 
theorems  which  follow  we  assume  that  the  functions  f(x)  and  F(x)  have 
derivatives  f'(x)  and  F\x)  for  the  values  of  x  considered. 

(1)  If  4>(x)  =  f(x)  +  F(x),  then  <j>(x)  has  a  derivative 

(j>'(x)  =  f'(x)  +  F'{x). 


(2)  If  4>(x)  =  kf(x),  where  k  is  a  constant,  then  (j>(x)  has  a  derivative 

<t>\x)  =  kf\x ). 

We  leave  it  as  an  exercise  to  the  reader  to  deduce  these  results  from 
the  general  theorems  stated  in  Ex.  XXXV.  1. 

(3)  If  <f>(x)  =  f(x)F(x),  then  <f>(x)  has  a  derivative 

4>'(x)  =  f(x)F'(x)  +  f\x)F(x). 


For 


4>'(x)  =  lim 


f(x  +  h)F(x  +  h )  —  f(x)F(x) 


lim  < 


h 


F(x  +  h)  -  F(x)  f(x  +  h)  —  f(x) 

—  lim  {  f(x  +  h) — - - l - ^  +  F(x)—  ’  w 


h 


h 


=  f(x)F'(x)  +  F(x)f'(x). 
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(4)  If  (j>{x)  = 


f(x) 


then  <j)(x)  has  a  derivative 


(x  = 


f'(x) 


{my 


In  this  theorem  we  of  course  suppose  that  f(x)  is  not  equal  to  zero  for 
the  particular  value  of  x  under  consideration.  Then 


l'(x)  =  Min  I  f  /W  ~  f(x  +  ,!) 


f(x) 


h  \  f(x  +  h)f(x) 


?{x) 


imr 


(5)  If  <p(x)  =  ,  then  <f>(x)  has  a  derivative 

r  (x) 


x  = 


f(x)F(x)  -  f(x)Ffx) 

{F(x)y 


This  follows  at  once  from  (3)  and  (4). 

(6)  If  4>{x)  =  F{f(x)},  then  <j)(x)  has  a  derivative 

<t>\x)  =  F'{f{x)}f\x). 

For  let 

f(x)=y,  f(x  +  h)  =  y  +  k. 
Then  k  — >  0  as  h  — >  0,  and  k/h  -A  f'(x).  And 


h 


=  lim 


F(y  +  k)-F(y)\ 

k  J 


x  lim  |  — 

h 


=  F\y)f\x). 


This  theorem  includes  (2)  and  (4)  as  special  cases,  as  we  see  on  taking 
F(x)  =  kx  or  F(x)  =  1/x.  Another  interesting  special  case  is  that  in  which 
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f(x)  =  ax  +  b:  the  theorem  then  shows  that  the  derivative  of  F(ax  +  b ) 
is  aF'(ax  +  b ). 

Our  last  theorem  requires  a  few  words  of  preliminary  explanation.  Sup¬ 
pose  that  x  =  if(y),  where  if(y)  is  continuous  and  steadily  increasing  (or 
decreasing),  in  the  stricter  sense  of  §  95,  in  a  certain  interval  of  values  of  y. 
Then  we  may  write  y  =  0(x),  where  0  is  the  ‘inverse’  function  (§  109)  of  0. 

(7)  If  y  =  4>(x),  where  0  is  the  inverse  function  of  if,  so  that  x  =  if(y), 
and  if(y)  has  a  derivative  if'(y)  which  is  not  equal  to  zero,  then  <f>(x)  has 
a  derivative 


For  if  (f(x  +  h)  —  y  +  k,  then  k  — >  0  as  h  — >  0,  and 

,//  N  r  (fix  +  h)  -  (j)(x)  _  (y  +  k)-y  _  1 

9('°  “ o  (x  +  h)-x  k™if(y  +  k)-if{y)  ip{yy 

The  last  function  may  now  be  expressed  in  terms  of  x  by  means  of  the 
relation  y  =  0(x),  so  that  <f>'(x)  is  the  reciprocal  of  0'{0(x)}.  This  theorem 
enables  us  to  differentiate  any  function  if  we  know  the  derivative  of  the 
inverse  function. 


114.  Derivatives  of  complex  functions.  So  far  we  have  supposed 
that  y  =  0(x)  is  a  purely  real  function  of  x.  If  y  is  a  complex  function 
0(x)  +  iip(x),  then  we  define  the  derivative  of  y  as  being  0'(x)  +  iif'{x). 
The  reader  will  have  no  difficulty  in  seeing  that  Theorems  (l)-(5)  above 
retain  their  validity  when  0(x)  is  complex.  Theorems  (6)  and  (7)  have  also 
analogues  for  complex  functions,  but  these  depend  upon  the  general  notion 
of  a  ‘function  of  a  complex  variable’,  a  notion  which  we  have  encountered 
at  present  only  in  a  few  particular  cases. 

115.  The  notation  of  the  differential  calculus.  We  have  already 
explained  that  what  we  call  a  derivative  is  often  called  a  differential  coeffi¬ 
cient.  Not  only  a  different  name  but  a  different  notation  is  often  used;  the 
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derivative  of  the  function  y  =  4>(x)  is  often  denoted  by  one  or  other  of  the 
expressions 

n  dy 

Dxy ,  ~r- 

ax 

Of  these  the  last  is  the  most  usual  and  convenient:  the  reader  must  however 
be  careful  to  remember  that  dy/dx  does  not  mean  ‘a  certain  number  dy 
divided  by  another  number  dxK.  it  means  ‘the  result  of  a  certain  opera¬ 
tion  Dx  or  d/dx  applied  to  y  —  (j>{x)\  the  operation  being  that  of  forming 
the  quotient  {4>(x  +  h)  —  </>(x)}/h  and  making  h  — y  0. 

Of  course  a  notation  at  first  sight  so  peculiar  would  not  have  been  adopted 
without  some  reason,  and  the  reason  was  as  follows.  The  denominator  h  of  the 
fraction  {(j){x+h)  —  (j>(x)}/h  is  the  difference  of  the  values  x+h,  x  of  the  indepen¬ 
dent  variable  x;  similarly  the  numerator  is  the  difference  of  the  corresponding 
values  <f>{x  +  h),  4>(x)  of  the  dependent  variable  y.  These  differences  may  be 
called  the  increments  of  x  and  y  respectively,  and  denoted  by  dx  and  5y.  Then 
the  fraction  is  5y/5x,  and  it  is  for  many  purposes  convenient  to  denote  the  limit 
of  the  fraction,  which  is  the  same  thing  as  4>'(x),  by  dy/dx.  But  this  notation 
must  for  the  present  be  regarded  as  purely  symbolical.  The  dy  and  dx  which 
occur  in  it  cannot  be  separated,  and  standing  by  themselves  they  would  mean 
nothing:  in  particular  dy  and  dx  do  not  mean  limdy  and  limtfx,  these  limits 
being  simply  equal  to  zero.  The  reader  will  have  to  become  familiar  with  this 
notation,  but  so  long  as  it  puzzles  him  he  will  be  wise  to  avoid  it  by  writing  the 
differential  coefficient  in  the  form  Dxy,  or  using  the  notation  c/>(x),  <j>'(x),  as  we 
have  done  in  the  preceding  sections  of  this  chapter. 

In  Ch.  VII,  however,  we  shall  show  how  it  is  possible  to  define  the  symbols 
dx  and  dy  in  such  a  way  that  they  have  an  independent  meaning  and  that  the 
derivative  dy/dx  is  actually  their  quotient. 

The  theorems  of  §  113  may  of  course  at  once  be  translated  into  this 
notation.  They  may  be  stated  as  follows: 

(1)  ify  =  Vi+V2,  then 

dy_  _  d/yi  +  4, 

dx  dx  dx  ’ 


dy  _  ,dyx 
dx  dx  ’ 


(2)  if  y  =  kyi,  then 
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(3)  ify 

(4)  ify 

(5)  ify 


ym,  then 


— ,  then 
Z/i 


— ,  then 
V2 


dy  dy2  ,  dy1 

~r  ~  2/i~ 7  i~  2/2  —1  ; 

dx  dx  dx 


dy  _  _  l_  dyi 
dx  y^  dx  ’ 


dy 

dx 


(6)  if  y  is  a  function  of  x,  and  z  a  function  of  y,  then 


(7) 


dz  dz  dy 
dx  dy  dx1 


dy 

dx 


Examples  XL.  1.  If  y  =  yiy2yd,  then 


dy 

dx 


dyi 

=  V2y3  -r-  +  y$y\ 

dx 


dy 2 

dx 


+  y  iV2 


dy 3 
dx  ’ 


and  if  y  =  y\y2  . . .  yn  then 


dy 

dx 


n 

2/12/2  • 

r= 1 


dyr 

■  ■  Vr—iyr+l  •  •  ■  Vn  , 

dx 


In  particular,  if  y  =  zn,  then  dy/dx  =  nzn  1  (dz/dx);  and  if  y 
dy/dx  =  nxn_1,  as  was  proved  otherwise  in  Ex.  xxxix.  3. 

2.  If  y  =  yiy2  . . .  yn  then 

ldy  =  JLdyi+J_dy2_^ _ |  1  dyn 

y  dx  yi  dx  y2  dx  yn  dx 

1  dy  n  dz 

In  particular,  it  y  =  z  ,  then  — —  = - — . 

y  dx  z  dx 


xn ,  then 
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116.  Standard  forms.  We  shall  now  investigate  more  systematically 
the  forms  of  the  derivatives  of  a  few  of  the  the  simplest  types  of  functions. 
A.  Polynomials.  If  <p(x)  =  a0xn  +  a±x n_1  +  •  •  •  +  an,  then 

4>'{x)  =  na0x n_1  +  (n  —  l)a\Xn~2  +  •  •  •  +  an_  i. 

It  is  sometimes  more  convenient  to  use  for  the  standard  form  of  a  polyno¬ 
mial  of  degree  n  in  x  what  is  known  as  the  binomial  form,  viz. 

a0xn  +  nix71-1  +  a-2Xn~2  H - f  an. 

In  this  case 

4>\x)  =  n  |a0a:n_1  +  2  1^a2a;n“3  H - f  an_i  j  . 

The  binomial  form  of  <f>{x)  is  often  written  symbolically  as 

(oo,  •  •  •  j  i 

and  then 

4>'(x)  =  n(a0,  ai, ,  an_ i  jj  x,  l)”-1. 

We  shall  see  later  that  (f>(x)  can  always  be  expressed  as  the  product  of 
n  factors  in  the  form 

4>(x)  =  a0(x  -  af){x  -  a2)  ■  ■  ■  (x  —  an), 

where  the  a’s  are  real  or  complex  numbers.  Then 

(f\x)  =  a0J2(x  -  a2)(x  -a3)...(x-  an), 

the  notation  implying  that  we  form  all  possible  products  of  n  —  1  factors, 
and  add  them  all  together.  This  form  of  the  result  holds  even  if  several 
of  the  numbers  a  are  equal;  but  of  course  then  some  of  the  terms  on  the 
right-hand  side  are  repeated.  The  reader  will  easily  verify  that  if 

4>(x)  =  a0(x  —  ai)mi(x  —  a2)m2  ■  ■  ■  (x  —  au)m'', 

then 

4>'{x)  =  aoJ2‘mi(x  —  ai)mi_1(a;  —  a2)ni2 . . .  (x  —  aL,)mv . 
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Examples  XLI.  1.  Show  that  if  <j)(x)  is  a  polynomial  then  (f>'{x)  is  the 
coefficient  of  h  in  the  expansion  of  <f>(x  +  h)  in  powers  of  h. 

2.  If  </>(x)  is  divisible  by  (x  —  a)2,  then  <f>'{x)  is  divisible  by  x  —  a:  and 
generally,  if  (f>{x)  is  divisible  by  (x  —  a)m,  then  4>'{x)  is  divisible  by  (x  —  a)m_1. 

3.  Conversely,  if  4>(x)  and  4>'(x)  are  both  divisible  by  x  —  a,  then  4>(x)  is 
divisible  by  (x  —  a)2;  and  if  4>{x)  is  divisible  by  x  —  a  and  (f/(x)  by  (x  —  a)m_1, 
then  4>(x)  is  divisible  by  (x  —  a)m. 

4.  Show  how  to  determine  as  completely  as  possible  the  multiple  roots  of 
P{x)  =  0,  where  P(x)  is  a  polynomial,  with  their  degrees  of  multiplicity,  by 
means  of  the  elementary  algebraical  operations. 

[If  H±  is  the  highest  common  factor  of  P  and  P' ,  H2  the  highest  common 
factor  of  H\  and  P" ,  H%  that  of  H-2  and  P'" ,  and  so  on,  then  the  roots  of 
H\  H3/H2  =  0  are  the  double  roots  of  P  =  0,  the  roots  of  H2H4/H2  =  0  the 
treble  roots,  and  so  on.  But  it  may  not  be  possible  to  complete  the  solution  of 
H1H3/H2  =  0,  H2H4/H2  =  0,  ....  Thus  if  P(x)  =  (x  -  l)3(x5  -  x  -  7)2  then 
H1H3/H2  =  x5  —  x  —  7  and  H2H4/H2  =  x  —  1;  and  we  cannot  solve  the  first 
equation.] 

5.  Find  all  the  roots,  with  their  degrees  of  multiplicity,  of 

x4  +  3x3  —  3x2  —  llx  —  6  =  0,  x6  +  2x5  —  8x4  —  14x3  +  llx2  +  28x  +  12  =  0. 

6.  If  ax2  +  2 bx  +  c  has  a  double  root,  i.e.  is  of  the  form  a(x  —  a)2,  then 
2 (ax  +  b)  must  be  divisible  by  x  —  a,  so  that  a  =  —b/a.  This  value  of  x  must 
satisfy  ax2  +  26x  +  c  =  0.  Verify  that  the  condition  thus  arrived  at  is  ac—b2  =  0. 

7.  The  equation  1/  (x  —  a)  +  1  /  (x  —  b)  +  1/  (x  —  c)  =  0  can  have  a  pair  of 

equal  roots  only  if  a  =  b  =  c.  (Math.  Trip.  1905.) 

8.  Show  that 

ax3  +  3  bx2  +  3cx  +  d  =  0 

has  a  double  root  if  G2  +  4 H3  =  0,  where  H  =  ac  —  b2,  G  =  a2 d  —  3 abc  +  2b 3. 

[Put  ax  +  b  =  y,  when  the  equation  reduces  to  y 3  +  3 Hy  +  G  =  0.  This  must 
have  a  root  in  common  with  y2  +  H  =  0.] 

9.  The  reader  may  verify  that  if  a,  /3,  7,  5  are  the  roots  of 

ax 4  +  46x3  +  6cx2  +  4  dx  +  e  =  0, 
then  the  equation  whose  roots  are 


Va{(a  -  /3)( 7  -6)-( 7  -  a)(/3  -  <5)} 
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and  two  similar  expressions  formed  by  permuting  a,  j3,  7  cyclically,  is 

46>3  -  g20  -  g3  =  0, 

where 

g2  =  ae  —  4bd  +  3c2 ,  53  =  ace  +  2  bed  —  ad 2  —  eb2  —  c3 . 

It  is  clear  that  if  two  of  a,  (3,  7,  5  are  equal  then  two  of  the  roots  of  this  cubic 
will  be  equal.  Using  the  result  of  Ex.  8  we  deduce  that  g\  —  27 g\  =  0. 

10.  Rolle’s  Theorem  for  polynomials.  If  <f>{x)  is  any  polynomial,  then 
between  any  pair  of  roots  of  4>{x)  =  0  lies  a  root  of  fix)  =  0. 

A  general  proof  of  this  theorem,  applying  not  only  to  polynomials  but  to 
other  classes  of  functions,  will  be  given  later.  The  following  is  an  algebraical 
proof  valid  for  polynomials  only.  We  suppose  that  a,  (3  are  two  successive  roots, 
repeated  respectively  m  and  n  times,  so  that 

<f>{x)  =  (x  —  a)m(x  —  (3)n9{x), 

where  d(x)  is  a  polynomial  which  has  the  same  sign,  say  the  positive  sign,  for 
a  ^  x  ^  (3.  Then 

f{x)  =  (x-  a)m(x  -  P)n0'(x)  +  {mix  -  a^fx  -  /3)n  +  n{x  -  a)m{x  - 

=  (x  —  a)m~1{x  —  f3)n~1[{x  —  a){x  —  f3)9'{x)  +  {m{x  —  0)  +  n(x  —  a)}9{x)\ 

=  {x-a)m-l{x- PY^Fix), 

say.  Now  F(a )  =  m{a  —  f3)6{a)  and  F(f3)  =  n{[3  —  a)6{/3),  which  have  opposite 
signs.  Hence  Fix),  and  so  f{x),  vanishes  for  some  value  of  x  between  a  and  /3. 


117.  B.  Rational  Functions.  If 


R{x) 


P(x) 
Qix)  ’ 


where  P  and  Q  are  polynomials,  it  follows  at  once  from  §  113,  (5)  that 


p//  x  _  P'(x)Q{x)  -  P{x)Q'(x) 

{Q(x)}2 

and  this  formula  enables  us  to  write  down  the  derivative  of  any  rational 
function.  The  form  in  which  we  obtain  it,  however,  may  or  may  not  be  the 
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simplest  possible.  It  will  be  the  simplest  possible  if  Q(x)  and  Q'(x)  have 
no  common  factor,  i.e.  if  Q(x)  has  no  repeated  factor.  But  if  Q(x)  has 
a  repeated  factor  then  the  expression  which  we  obtain  for  R'(x)  will  be 
capable  of  further  reduction. 

It  is  very  often  convenient,  in  differentiating  a  rational  function,  to 
employ  the  method  of  partial  fractions.  We  shall  suppose  that  Q(x),  as  in 
§  116,  is  expressed  in  the  form 


a  0(x  -  ai)mi(x  -  a2)m2  ...(x-  au)mv . 


Then  it  is  proved  in  treatises  on  Algebra*  that  R(x)  can  be  expressed  in 
the  form 


m  ,  ^1,2 

II(x)  H - ! - h 


+ 


x  —  a  i  (x  —  ai)‘ 

A2,i  A 


+  •••  + 


A 


l,mi 


+ 


2,2 


x  —  a2  (x  —  a2)2 


(x  -  Oii)mi 

+  ■■■+,  +  ..., 


(x  —  a2)mz 


where  II (x)  is  a  polynomial;  i.e.  as  the  sum  of  a  polynomial  and  the  sum 
of  a  number  of  terms  of  the  type 

A 

(. x  —  ot)pl 


where  a  is  a  root  of  Q(x)  =  0.  We  know  already  how  to  find  the  derivative 
of  the  polynomial:  and  it  follows  at  once  from  Theorem  (4)  of  §  113,  or,  if 
a  is  complex,  from  its  extension  indicated  in  §  114,  that  the  derivative  of 
the  rational  function  last  written  is 

pA(x  —  a)p_1  pA 

(x  -  a)2p  ~  (x  -  a)p+1 ' 


We  are  now  able  to  write  down  the  derivative  of  the  general  rational 
function  R(x),  in  the  form 


IT  (a;) 


Ail 

(x  —  ai)2 


2A1j2 
(x  —  Qfi)3 


A  2,1  _  2A22 

(x  -  a2)2  (x  -  a2)3 


See,  e.g.,  Chrystal’s  Algebra ,  vol.  i,  pp.  151  et  seq. 
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Incidentally  we  have  proved  that  the  derivative  of  xm  is  mi™'1,  for  all 
integral  values  of  m  positive  or  negative. 

The  method  explained  in  this  section  is  particularly  useful  when  we 
have  to  differentiate  a  rational  function  several  times  (see  Exs.  xlv). 

Examples  XLII.  1.  Prove  that 

d  f  x  \  1  —  x2  d  f  1  —  x2\  4x 

dx  \  1  +  x2)  (1  +  x2)2’  dx  \  1  +  x2 )  (1+x2)2' 

2.  Prove  that 

d  (  ax2  +  2bx  +  c  \  (ax  +  b)(Bx  +  C)  —  (bx  +  c){Ax  +  B) 

dx  \  Ax2  +  2 Bx  +  C )  (Ax2  +  2 Bx  +  C )2 

3.  If  Q  has  a  factor  (x  —  a)m  then  the  denominator  of  R'  (when  R'  is 
reduced  to  its  lowest  terms)  is  divisible  by  (x  —  a)m+l  but  by  no  higher  power 
of  x  —  a. 

4.  In  no  case  can  the  denominator  of  R'  have  a  simple  factor  x  —  a.  Hence 
no  rational  function  (such  as  1  / x)  whose  denominator  contains  any  simple  factor 
can  be  the  derivative  of  another  rational  function. 


118.  C.  Algebraical  Functions.  The  results  of  the  preceding  sec¬ 
tions,  together  with  Theorem  (6)  of  §  113,  enable  us  to  obtain  the  derivative 
of  any  explicit  algebraical  function  whatsoever. 

The  most  important  such  function  is  x m,  where  m  is  a  rational  number. 
We  have  seen  already  (§  117)  that  the  derivative  of  this  function  is  mi™"1 
when  m  is  an  integer  positive  or  negative;  and  we  shall  now  prove  that  this 
result  is  true  for  all  rational  values  of  m.  Suppose  that  y  =  xm  =  xp^q, 
where  p  and  q  are  integers  and  q  positive;  and  let  z  =  x 1/,g,  so  that  x  =  zq 
and  y  =  zp .  Then 

^=(dj\/(dx\=lzV-q  =  mxm-l 

dx  \dz  J  /  \dz )  q 
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This  result  may  also  be  deduced  as  a  corollary  from  Ex.  xxxvi.  3.  For, 
if  (p(x)  =  xm,  we  have 


(x 


=  lim 

/i->  o 

=  lim 

£— >x 


(x  +  h)r 


x 


x 


h 

m 


£-x 


=  mx 


m—  1 


It  is  clear  that  the  more  general  formula 

-^-(ax  +  b)m  =  maiax  +  b)m~l 
ax 

holds  also  for  all  rational  values  of  m. 

The  differentiation  of  implicit  algebraical  functions  involves  certain  the¬ 
oretical  difficulties  to  which  we  shall  return  in  Ch.  VII.  But  there  is  no 
practical  difficulty  in  the  actual  calculation  of  the  derivative  of  such  a 
function:  the  method  to  be  adopted  will  be  illustrated  sufficiently  by  an 
example.  Suppose  that  y  is  given  by  the  equation 


x3  +  y3  —  3  axy  =  0. 


Differentiating  with  respect  to  x  we  find 


and  so 


2  ,  2  dy 

x  +y  — 
ax 


(y  +  x 


dy  x2  —  ay 

dx  y2  —  ax 


0 


Examples  XLIII.  1. 

/  ax  +  b 


Find  the  derivatives  of 


1  +  x 
1  —  x' 


ax 2  +  2  bx  +  c 


cx  +  d  ’  V  Ax 2  +  2  Bx  +  C  ’ 


(ax  +  b)m(cx  +  dy 


2.  Prove  that 


d  1 

r  *  i 

I  a2  d  I 

r  x  1 

L_ 

dx  } 

[  \J a2  +  x2  J 

(a2  +  x2)!3/2)  ’  dx  I 

[  Va2  —  x2  J 

1  (a2 

—  x2)3/2 
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3.  Find  the  differential  coefficient  of  y  when 

(i)  ax 2  +  2  hxy  +  by 2  +  2  gx  +  2  fy  +  c  =  0,  (ii)  x5  +  y5  —  5  ax2y2  =  0. 

119.  D.  Transcendental  Functions.  We  have  already  proved 
(Ex.  xxxix.  4)  that 

Dx  sin  a;  =  cos  a;,  Dx  cosx  =  —  sin  a;. 

By  means  of  Theorems  (4)  and  (5)  of  §  113,  the  reader  will  easily  verify 
that 

Dx  tan  x  =  sec2  x ,  Dx  cot  x  =  —  cosec2  x, 

Dx  sec  x  =  tan  x  sec  x,  Dx  cosec  x  =  —  cot  x  cosec  x. 

And  by  means  of  Theorem  (7)  we  can  determine  the  derivatives  of  the 
ordinary  inverse  trigonometrical  functions.  The  reader  should  verify  the 
following  formulae: 

Dx  arc  sin  x  =  ±1/ \/l  —  x2,  Dx  arc  cos  x  —  ^1/ \J  1  —  x2, 

Dx  arc  tan  x  =  1/(1  +  x2),  Dx  arccota;  =  —1/(1  +  x2), 

Dx  arc  sec  x  =  ±l/{x\/x 2  —  1},  Dx  arc  cosec  a:  =  ^l/{x\/x2  —  1}. 

In  the  case  of  the  inverse  sine  and  cosecant  the  ambiguous  sign  is  the  same 
as  that  of  cos  (arc  sin  x),  in  the  case  of  the  inverse  cosine  and  secant  the 
same  as  that  of  sin ( arc  cos  x). 

The  more  general  formulae 

Dx  arcsin(a;/a)  =  ±1  / V a2  —  x2,  Dx  arctan(x/a)  =  a/(x2  +  a2), 

which  are  also  easily  derived  from  Theorem  (7)  of  §  113,  are  also  of  consid¬ 
erable  importance.  In  the  first  of  them  the  ambiguous  sign  is  the  same  as 
that  of  a  cos {  arc  sin (x/a) } ,  since 

ay/ 1  —  ( x2  ja 2)  =  ±V  a2  —  x2 
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according  as  a  is  positive  or  negative. 

Finally,  by  means  of  Theorem  (6)  of  §  113,  we  are  enabled  to  differenti¬ 
ate  composite  functions  involving  symbols  both  of  algebraical  and  trigono¬ 
metrical  functionality,  and  so  to  write  down  the  derivative  of  any  such 
function  as  occurs  in  the  following  examples. 

Examples  XLIV.*  1.  Find  the  derivatives  of 

cos  x ,  sm  x,  cos  x  ,  sm  x  ,  cos  (sin®),  sin(cos2j. 

ry - 7. - .  9 —  cos  x  sin x 

V  a  cos^  x  +  cr  sm  2,  —  — , 

V  a 2  cos2  2  T  b2  sin2  x 

x  arc  sin  x  +  v/ 1  —  22,  ( 1  +  2)  arc  tan  yfx  —  yfx. 


2.  Verify  by  differentiation  that  arc  sin  2  + arc  cos  2  is  constant  for  all  values 
of  2  between  0  and  1,  and  arc  tan  2  +  arc  cot  2  for  all  positive  values  of  2. 

3.  Find  the  derivatives  of 


arc  sm 


\/l  —  22,  arcsin{22  \/l  —  22},  arc  tan  (  —  )  . 

\1  —  ax  J 


How  do  you  explain  the  simplicity  of  the  results? 

4.  Differentiate 


1 


y/ac—b2 


arc  tan 


ax  +  b 
y/ac—b2' 


1  ax  +  b 

- —  arc  sm 
'  —a 


\Jb 2  —  ac 


5.  Show  that  each  of  the  functions 

2y/(a  —  2)  (2  —  /3) 
a  —  (5 

has  the  derivative 

1 

y/(a-  2)  (2  -  /?) 

*In  these  examples  m  is  a  rational  number  and  a,  b,  . . . ,  a,  ) 3  .. .  have  such  values 
that  the  functions  which  involve  them  are  real. 


2  arc  sin 


12-/5 
a  —  /?’ 


2  arc  tan 


2  —  (5 


a  —  2 


arc  sm 
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6.  Prove  that 


<d_ 

dd 


cos  8  cos  38 


7.  Show  that 


(Math.  Trip.  1904.) 


1  _d_ 

y/ C(Ac  -  aC )  dx 


arc  cos 


'  C(ax2  +  c ) 
c(Ax 2  +  C) 


1 

(Ax2  +  C)  Vaa:2  +  c 


8.  Each  of  the  functions 


V  a,2  —  b2 


arc  cos 


a  cos  x  +  b 


a  +  b  cos  x  y  ’  Va2  -  62 


arc  tan 


I  a  —  b 
a  +  b 


tan  lx 


has  the  derivative  1  / (o  +  6  cos  x) . 

9.  If  V  =  a  +  b  cos  x  +  c  sin  x,  and 


1 


V  = 


\J  a2  —  b2  —  c2 


arc  cos 


aX  -a2 +  b2  +  c2 
Xs/b2  +  c2  : 


then  dy/dx  =  1/X. 

10.  Prove  that  the  derivative  of  F[/{</>(x)}]  is  F'[f{(f>(x)}\  f  {4>(x)}(t>' (x), 
and  extend  the  result  to  still  more  complicated  cases. 

11.  If  u  and  v  are  functions  of  x,  then 


Dx  arc  tan (u/v)  =  (vDxu  —  uDxv)/(u2  +  v2). 


12.  The  derivative  of  y  =  (tanx  +  secx)m  is  my  sec  x. 

13.  The  derivative  of  y  =  cos  x  +  i  sin  x  is  iy. 

14.  Differentiate  xcosx,  (sinx)/x.  Show  that  the  values  of  x  for  which  the 
tangents  to  the  curves  y  =  xcosx,  y  =  (sinx)/x  are  parallel  to  the  axis  of  x  are 
roots  of  cotx  =  x,  tanx  =  x  respectively. 

15.  It  is  easy  to  see  (cf.  Ex.  XVII.  5)  that  the  equation  sinx  =  ax,  where  a  is 
positive,  has  no  real  roots  except  x  =  0  if  a  ^  1,  and  if  a  <  1  a  finite  number  of 
roots  which  increases  as  a  diminishes.  Prove  that  the  values  of  a  for  which  the 
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number  of  roots  changes  are  the  values  of  cos£,  where  £  is  a  positive  root  of  the 
equation  tan£  =  £.  [The  values  required  are  the  values  of  a  for  which  y  =  ax 
touches  y  =  sin  x.] 

16.  If  0(x)  =  x2  sin(l/x)  when  x  ^  0,  and  0(0)  =  0,  then 
0' (x)  =  2.xsin(l/x)  —  cos(l/x) 


when  x  /  0,  and  0' (0)  =  0.  And  0'(x )  is  discontinuous  for  x  =  0  (cf.  §  111,  (2)). 

17.  Find  the  equations  of  the  tangent  and  normal  at  the  point  (xo,yo)  of 
the  circle  x2  +  y2  =  a2. 

[Here  y  =  V a2  —  x2,  dy/dx  =  —x/ffa2  —  x 2,  and  the  tangent  is 
U  ~  Vo  =  0  -  x0)  j-z0/ \ja2  -  xgj  , 

which  may  be  reduced  to  the  form  xxq  +  yyo  =  a2.  The  normal  is  xyo  —  yx o  =  0, 
which  of  course  passes  through  the  origin.] 

18.  Find  the  equations  of  the  tangent  and  normal  at  any  point  of  the  ellipse 
(x/a)2  +  ( y/b )2  =  1  and  the  hyperbola  ( x/a )2  —  ( y/b )2  =  1. 

19.  The  equations  of  the  tangent  and  normal  to  the  curve  x  =  0(t ),  y  =  0(t), 
at  the  point  whose  parameter  is  t,  are 


x  —  0(t) 
0'(t) 


y-0(t) 

0'(t) 


{ x 


0(t)}0'(t)  +  {y-  0(t)}0'(t)  =  0. 


120.  Repeated  differentiation.  We  may  form  a  new  function  0"(x) 
from  0'(x )  just  as  we  formed  0'(x )  from  0(x).  This  function  is  called  the 
second  derivative  or  second  differential  coefficient  of  0(x).  The  second 
derivative  of  y  =  0(x)  may  also  be  written  in  any  of  the  forms 


/  d  \ 2  d2y 
dx 2’ 


In  exactly  the  same  way  we  may  define  the  nth  derivative  or  nth  dif¬ 
ferential  coefficient  of  y  =  0(x),  which  may  be  written  in  any  of  the  forms 


dny 

dxn' 


D£y, 


d 

dx 


y, 
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But  it  is  only  in  a  few  cases  that  it  is  easy  to  write  down  a  general  formula 
for  the  nth  differential  coefficient  of  a  given  function.  Some  of  these  cases 
will  be  found  in  the  examples  which  follow. 

Examples  XLV.  1.  If  cj)(x)  =  xm  then 

(fin\x)  =  m(m  —  1) . . .  (m  —  n  +  l)xm~n. 

This  result  enables  us  to  write  down  the  nth  derivative  of  any  polynomial. 

2.  If  4>(x)  =  (ax  +  b)m  then 

^n\x)  =  m(m  —  1 ) . . .  (m  —  n  +  l)an(ax  +  b)m~n. 

In  these  two  examples  m  may  have  any  rational  value.  If  m  is  a  positive  integer, 
and  n  >  m,  then  q (x)  =  0. 

3.  The  formula 

( AV  A  =  i  1)nP(P+1)  + 

\dx )  (x  —  a)P  {  ’  ( x-a)P+n 

enables  us  to  write  down  the  nth  derivative  of  any  rational  function  expressed 
in  the  standard  form  as  a  sum  of  partial  fractions. 

4.  Prove  that  the  nth  derivative  of  1/(1  —  x2)  is 

^HKa-xr^  +  t-ini+x)-"-1}. 

5.  Leibniz’  Theorem.  If  y  is  a  product  uv,  and  we  can  form  the  first 
n  derivatives  of  u  and  v,  then  we  can  form  the  nth  derivative  of  y  by  means  of 
Leibniz’  Theorem,  which  gives  the  rule 

(uv)n  =  UnV  +  Un-lVi  +  (^Jun-2V 2  H - h  H - f  UVn, 

where  suffixes  indicate  differentiations,  so  that  un,  for  example,  denotes  the 
nth  derivative  of  u.  To  prove  the  theorem  we  observe  that 

(uv)  1  =  U\V  +  uv  1, 

(uv)  2  =  u2v  +  2u\v\  +  uv  2, 
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and  so  on.  It  is  obvious  that  by  repeating  this  process  we  arrive  at  a  formula  of 
the  type 


( uv)n  —  UnV  +  an,\Un—\V\  +  fln,2Wn- 2^2  +  '  '  '  +  <ln,rUn-rVr  +  '  '  '  +  UVn. 


Tl 

Let  us  assume  that  anr  =  (  )  for  r  =  1,  2,  . . . ,  n  —  1,  and  show  that  if 

V  r 


this  is  so  then  an+ 1  ,r  = 


n  +  1 
r 


for  r  =  1,2,...  n.  It  will  then  follow  by  the 


n 


principle  of  mathematical  induction  that  an,r  =  ^  J  for  all  values  of  n  and  r 
in  question. 

When  we  form  (uv)n+ 1  by  differentiating  (uv)n  it  is  clear  that  the  coefficient 
of  un+ \-rvr  is 


^n,r  T  0"n,i — 1  — 


n 


+ 


n 

r  —  1 


n  +  1 
r 


This  establishes  the  theorem. 

6.  The  nth  derivative  of  xmf(x)  is 


ml  m—n  \  ,  m- 

-x  f(x)  +  n 


m—n+ 1  i*/ 


(?n  —  n) ! 


+ 


(m  —  n  +  1)! 
n(n  —  1) 


x 


m! 


/  (*) 


n+2  t// 


1-2  (m.  —  n  +  2)!' 


f"(x)  + 


the  series  being  continued  for  n  +  1  terms  or  until  it  terminates. 

7.  Prove  that  D™  cosx  =  cos(x  +  ^nir),  D™  sinx  =  sin(x  +  ^n7r). 

8.  If  y  =  dcosrox  +  B  sin  mx  then  D^y  +  m2y  =  0.  And  if 


y  =  A  cos  mx  +  B  sin  mx  +  Pn(x ), 

where  Pn(x)  is  a  polynomial  of  degree  n,  then  +  m2  Dx+1y  =  0. 

9.  If  x2D2y  +  xDxy  +  y  =  0  then 

x2Dn+2y  +  (2  n  +  1  )xD^+1y  +  (n2  +  =  0. 


[Differentiate  n  times  by  Leibniz’  Theorem.] 
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10.  If  Un  denotes  the  nth  derivative  of  (Lx  +  M)/(x 2  —  2 Bx  +  C ),  then 


x2  —  2  Bx  +  C 
(n  +  1  )(n  +  2) 


Un+2  + 


2(x  —  B) 
n  +  1 


Un+ i  +  Un  —  0. 


(Math.  Trip.  1900.) 

[First  obtain  the  equation  when  n  =  0;  then  differentiate  n  times  by  Leibniz’ 
Theorem.] 

11.  The  nth  derivatives  of  a/ (a2  +  x2)  and  x/(a 2  +  x2).  Since 


a 

_I| 

f  1 

_  1  ^ 

X 

_  w 

a2  +  x2 

2  i  ' 

lx  —  ai 

x  +  ai  J 

a 2  +  x2 

2  l 

1 


1 


V  + 

x  —  ai  x  +  ai 


we  have 


Dn 

X 


az  +  xz 


(— l)nn!  f  1  1  | 

2 i  {  (x  -  ai)n+1  ~  (x  +  ai)n+1  J  ’ 


and  a  similar  formula  for  D™{x/(a2  +  x2)}.  If  p=  \/ x2  +  a2,  and  0  is  the 
numerically  smallest  angle  whose  cosine  and  sine  are  x/p  and  a/p,  then 
x  +  ai  =  p  Cis  6  and  x  —  ai  =  p  Cis(— 9) ,  and  so 

Dnx{a/(a 2  +  x2)}  =  {(— l)nn!/2i}/c>-n-1[Cis{(n  +  1)0}  -  Cis{-(n  +  1)0}] 

=  (— l)nn!  (x2  +  a2)“(n+1^2  sin{(n  +  1)  arctan(a/x)}. 


Similarly 

Dx{x/(a 2  +  x2)}  =  (— 1  )nn\  (x2  +  a2)^(n+1^2  cos{(n  +  1)  arctan(a/x)}. 


12.  Prove  that 


Dx{( cosx)/x}  =  {Pn  cos(x  +  \mt)  +  Qn  sin(x  +  ^n7r)}/xn+1, 
Dx{( sinx)/x}  =  {Pn  sin(x  +  ^rnr)  —  Qn  cos(x  +  ^n7r)}/xn+1, 

where  Pn  and  Qn  are  polynomials  in  x  of  degree  n  and  n  —  1  respectively. 
13.  Establish  the  formulae 

dx  _  /  f  dy\  d2x  _  d2y  /  /  dy\3 

dy  /  \  dx  J  '  dy2  dx2  /  \  dx )  1 
d3x  =  _fd37/dy_  /d2y\'l  / f dy\ 
dy 3  \  dx3  dx  \ dx2 )  J  /  \dx  J 


5 
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14.  If  yz  =  1  and  yr  =  (1  /r\)Drxy,  zs  =  (1  /s\)Dsxz,  then 


Z 1 

Z2 

1 

V2 

V3 

Z\ 

Z2 

Z3 

o 

2T 

V3 

Vi 

Z2 

Z3 

Z4 

(Math.  Trip.  1905.) 

15.  If 

y  z  u 

IV  (y,  z ,  u)  =  y'  z'  v!  , 

y"  V'  u" 

dashes  denoting  differentiations  with  respect  to  x,  then 

W(y,z,u)  =  y3w(l,-,-). 

V  y  yJ 

16.  If 

ax 2  +  2  hxy  +  by2  +  2gx  +  2  fy  +  c  =  0, 

then 

dy/dx  =  -(ax  +  hy  +  g)/ (hx  +  by  +  /) 

and 

d2y/dx 2  =  (abc  +  2/<?/i  —  af2  —  bg2  —  ch2)/(hx  +  by  +  /)3. 


121.  Some  general  theorems  concerning  derived  functions. 

In  all  that  follows  we  suppose  that  <fi(x)  is  a  function  of  x  which  has  a 
derivative  (j)'(x )  for  all  values  of  x  in  question.  This  assumption  of  course 
involves  the  continuity  of  <p(x ). 

The  meaning  of  the  sign  of  4>'(x).  Theorem  A.  If  cp'(xo)  >  0 
then  4>(x )  <  4>(xq )  for  all  values  of  x  less  than  xq  but  sufficiently  near  to  xq, 
and  <f>(x)  >  4>(xq )  for  all  values  of  x  greater  than  xq  but  sufficiently  near 
to  x0. 

For  {c f>(x0  +  h )  —  f>(x0)}/h  converges  to  a  positive  limit  4>'(x0 )  as  h  — >  0. 
This  can  only  be  the  case  if  f>(xo  +  h)  —  f>(xo)  and  h  have  the  same  sign 
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for  sufficiently  small  values  of  h,  and  this  is  precisely  what  the  theorem 
states.  Of  course  from  a  geometrical  point  of  view  the  result  is  intuitive, 
the  inequality  <f'{x)  >  0  expressing  the  fact  that  the  tangent  to  the  curve 
y  =  4>(x)  makes  a  positive  acute  angle  with  the  axis  of  x.  The  reader 
should  formulate  for  himself  the  corresponding  theorem  for  the  case  in 
which  <f\x)  <  0. 

An  immediate  deduction  from  Theorem  A  is  the  following  important 
theorem,  generally  known  as  Rollc’s  Theorem.  In  view  of  the  great  impor¬ 
tance  of  this  theorem  it  may  be  well  to  repeat  that  its  truth  depends  on 
the  assumption  of  the  existence  of  the  derivative  <f'(x)  for  all  values  of  x 
in  question. 

Theorem  B.  //0(a)  =  0  and  0(6)  =  0,  then  there  must  be  at  least 
one  value  of  x  which  lies  between  a  and  b  and  for  which  <f'(x)  =  0. 

There  are  two  possibilities:  the  first  is  that  <f>{x)  is  equal  to  zero 
throughout  the  whole  interval  [a,b].  In  this  case  <f\x)  is  also  equal  to 
zero  throughout  the  interval.  If  on  the  other  hand  (f>(x)  is  not  always  equal 
to  zero,  then  there  must  be  values  of  x  for  which  0(x)  is  positive  or  nega¬ 
tive.  Let  us  suppose,  for  example,  that  <f>{x)  is  sometimes  positive.  Then, 
by  Theorem  2  of  §  102,  there  is  a  value  £  of  x,  not  equal  to  a  or  b,  and  such 
that  0(£)  is  at  least  as  great  as  the  value  of  0(x)  at  any  other  point  in  the 
interval.  And  0'(£)  must  be  equal  to  zero.  For  if  it  were  positive  then  <p(x) 
would,  by  Theorem  A,  be  greater  than  0(£)  for  values  of  x  greater  than  £ 
but  sufficiently  near  to  £,  so  that  there  would  certainly  be  values  of  <p(x) 
greater  than  0(£).  Similarly  we  can  show  that  0'(£)  cannot  be  negative. 

Cor  1.  //0(a)  =  0(6)  =  k,  then  there  must  be  a  value  of  x  between 
a  and  b  such  that  cj>'{x)  =  0. 

We  have  only  to  put  (f>{x)  —  k  =  ijj(x)  and  apply  Theorem  B  to  if(x). 

Cor  2.  If  4>'(x)  >  0  for  all  values  of  x  in  a  certain  interval,  then  (f>{x)  is 
an  increasing  function  of  x,  in  the  stricter  sense  of  §  95,  throughout  that 
interval. 

Let  X\  and  X2  be  two  values  of  x  in  the  interval  in  question,  and  x\  <  a;2. 
We  have  to  show  that  <f>{x i)  <  0(x2).  In  the  first  place  0(aq)  cannot  be 
equal  to  0(x2);  for,  if  this  were  so,  there  would,  by  Theorem  B,  be  a  value 
of  x  between  x\  and  x2  for  which  <f>\x)  =  0.  Nor  can  0(aq)  be  greater 
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than  0(x2).  For,  since  0'(xi)  is  positive,  (j>(x)  is,  by  Theorem  A,  greater 
than  cj)(x i)  when  x  is  greater  than  x\  and  sufficiently  near  to  x\.  It  follows 
that  there  is  a  value  £3  of  x  between  x\  and  £2  such  that  4>{xz)  =  0(£i); 
and  so,  by  Theorem  B,  that  there  is  a  value  of  x  between  x\  and  £3  for 
which  4>'(x)  =  0. 

Cor  3.  The  conclusion  of  Cor.  2  still  holds  if  the  interval  [ a,b ]  consid¬ 
ered  includes  a  finite  number  of  exceptional  values  of  x  for  which  <f'(x)  does 
not  exist,  or  is  not  positive,  provided  <f(x)  is  continuous  even  for  these  ex¬ 
ceptional  values  of  x. 

It  is  plainly  sufficient  to  consider  the  case  in  which  there  is  one  excep¬ 
tional  value  of  x  only,  and  that  corresponding  to  an  end  of  the  interval, 
say  to  a.  If  a  <  x\  <  x2  <  b,  we  can  choose  a  +  5  so  that  a  +  S  <  x  1,  and 
<j)'{x)  >  0  throughout  [a  +  5,b],  so  that  <f{x  1)  <  0(£2),  by  Cor.  2.  All  that 
remains  is  to  prove  that  0(a )  <  <f{xf).  Now  <f(x  1)  decreases  steadily,  and 
in  the  stricter  sense,  as  x\  decreases  towards  a,  and  so 

0(a)  =  0(a  +  0)  =  lim  <f(xf)  <  <f{x  1). 

mi— >-a+0 

Cor  4.  If  <f>'(x)  >  0  throughout  the  interval  [a,b],  and  0(a)  A  0,  then 
0(x)  is  positive  throughout  the  interval  [a,  b] . 

The  reader  should  compare  the  second  of  these  corollaries  very  carefully 
with  Theorem  A.  If,  as  in  Theorem  A,  we  assume  only  that  0,(x)  is  positive  at 
a  single  point  x  =  xo,  then  we  can  prove  that  0(xi)  <  0(a;2)  when  x\  and  x2  are 
sufficiently  near  to  £0  and  x\  <  x$  <  x2.  For  0(xi)  <  0(x’o)  and  0(x2)  >  0(xoh 
by  Theorem  A.  But  this  does  not  prove  that  there  is  any  interval  including  xq 
throughout  which  0(x)  is  a  steadily  increasing  function,  for  the  assumption  that 
x\  and  X2  lie  on  opposite  sides  of  xq  is  essential  to  our  conclusion.  We  shall 
return  to  this  point,  and  illustrate  it  by  an  actual  example,  in  a  moment  (§  124). 


122.  Maxima  and  Minima.  We  shall  say  that  the  value  0(£)  as¬ 
sumed  by  0(x)  when  x  =  £  is  a  maximum  if  0(£)  is  greater  than  any  other 
value  assumed  by  0(x)  in  the  immediate  neighbourhood  of  x  —  £,  i.e.  if 
we  can  find  an  interval  [£  —  S,  £  +  5]  of  values  of  x  such  that  0(£)  >  <f(x) 
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when  £  —  5  <  x  <  £  and  when  £  <  x  <  £  +  5;  and  we  define  a  minimum  in 
a  similar  manner.  Thus  in  the  figure  the  points  A  correspond  to  maxima, 
the  points  B  to  minima  of  the  function  whose  graph  is  there  shown.  It  is 


Fig.  39. 

to  be  observed  that  the  fact  that  A3  corresponds  to  a  maximum  and  B\  to 
a  minimum  is  in  no  way  inconsistent  with  the  fact  that  the  value  of  the 
function  is  greater  at  B\  than  at  A3. 

Theorem  C.  A  necessary  condition  for  a  maximum  or  minimum 
value  of  <f{x)  at  x  —  £  is  that  £/(£)  =  0.* 

This  follows  at  once  from  Theorem  A.  That  the  condition  is  not  suffi¬ 
cient  is  evident  from  a  glance  at  the  point  C  in  the  figure.  Thus  if  y  =  x3 
then  (p'(x)  =  3x2,  which  vanishes  when  x  =  0.  But  x  =  0  does  not  give 
either  a  maximum  or  a  minimum  of  x3,  as  is  obvious  from  the  form  of  the 
graph  of  x3  (Fig.  10,  p.  51). 

But  there  will  certainly  be  a  maximum  at  x  =  £  if  0'(£)  =  0,  (p'(x)  >  0 
for  all  values  of  x  less  than  but  near  to  £,  and  4>'(x)  <  0  for  all  values 
of  x  greater  than  but  near  to  £:  and  if  the  signs  of  these  two  inequali¬ 
ties  are  reversed  there  will  certainly  be  a  minimum.  For  then  we  can  (by 
Cor.  3  of  §  121)  determine  an  interval  [£  —  <5,  £]  throughout  which  <f{x)  in¬ 
creases  with  x,  and  an  interval  [£,  £  +  5]  throughout  which  it  decreases  as 
x  increases:  and  obviously  this  ensures  that  £>(£)  shall  be  a  maximum. 

*  A  function  which  is  continuous  but  has  no  derivative  may  have  maxima  and  minima. 
We  are  of  course  assuming  the  existence  of  the  derivative. 
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This  result  may  also  be  stated  thus.  If  the  sign  of  <f>'(x)  changes  at 
x  =  £  from  positive  to  negative,  then  x  =  £  gives  a  maximum  of  (f>(x): 
and  if  the  sign  of  4>'{x)  changes  in  the  opposite  sense,  then  x  =  £  gives  a 

minimum. 

123.  There  is  another  way  of  stating  the  conditions  for  a  maximum 
or  minimum  which  is  often  useful.  Let  us  assume  that  <f>(x)  has  a  second 
derivative  (j>"{x)\  this  of  course  does  not  follow  from  the  existence  of  (f>'{x)1 
any  more  than  the  existence  of  <f/(x)  follows  from  that  of  £>(x).  But  in  such 
cases  as  we  are  likely  to  meet  with  at  present  the  condition  is  generally 
satisfied. 

Theorem  D.  If  £/(£)  =  0  and  </>"(£)  ^  0,  then  4>(x)  has  a  maximum 
or  minimum  at  x  =  £,  a  maximum  if  <j>"(£)  <  0,  a  minimum  if  0/;(£)  >  0. 

Suppose,  e.g.,  that  <f>"(£)  <  0.  Then,  by  Theorem  A,  <f>'(x)  is  negative 
when  x  is  less  than  £  but  sufficiently  near  to  £,  and  positive  when  x  is 
greater  than  £  but  sufficiently  near  to  £.  Thus  x  =  £  gives  a  maximum. 

124.  In  what  has  preceded  (apart  from  the  last  paragraph)  we  have  as¬ 
sumed  simply  that  <f{x)  has  a  derivative  for  all  values  of  x  in  the  interval  under 
consideration.  If  this  condition  is  not  fulfilled  the  theorems  cease  to  be  true. 
Thus  Theorem  B  fails  in  the  case  of  the  function 

y  =  1  — 

where  the  square  root  is  to  be  taken  positive.  The  graph  of  this  function  is 
shown  in  Fig.  40.  Here  (/>(—!)  =  </>(l)  =  0:  but  f//(x),  as  is  evident  from  the 
figure,  is  equal  to  1  if  x  is  negative  and  to  —1  if  x  is  positive,  and  never  vanishes. 
There  is  no  derivative  for  x  =  0,  and  no  tangent  to  the  graph  at  P.  And  in  this 
case  x  =  0  obviously  gives  a  maximum  of  cj>(x),  but  £/(0),  as  it  does  not  exist, 
cannot  be  equal  to  zero,  so  that  the  test  for  a  maximum  fails. 

The  bare  existence  of  the  derivative  <f'(x),  however,  is  all  that  we  have  as¬ 
sumed.  And  there  is  one  assumption  in  particular  that  we  have  not  made,  and 
that  is  that  <f'(x)  itself  is  a  continuous  function.  This  raises  a  rather  subtle  but 
still  a  very  interesting  point.  Can  a  function  <f(x)  have  a  derivative  for  all  values 
of  x  which  is  not  itself  continuous?  In  other  words  can  a  curve  have  a  tangent 
at  every  point,  and  yet  the  direction  of  the  tangent  not  vary  continuously?  The 
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reader,  if  he  considers  what  the  question  means  and  tries  to  answer  it  in  the 
light  of  common  sense,  will  probably  incline  to  the  answer  No.  It  is,  however, 
not  difficult  to  see  that  this  answer  is  wrong. 

Consider  the  function  0(x)  defined,  when  x  ^  0,  by  the  equation 

0(.t)  =  x2  sin(l/x); 


and  suppose  that  0(0)  =  0.  Then  0(x)  is  continuous  for  all  values  of  x.  If  x  ^  0 
then 

0/(x)  =  2xsin(l/x)  —  cos(l/x); 


while 


0'(O)  =  Jim 

h — >-0 


h?  sin(l/h) 
h 


=  0. 


Thus  0'(x)  exists  for  all  values  of  x.  But  0,(x)  is  discontinuous  for  x  =  0;  for 
2xsin(l/x)  tends  to  0  as  x  — >  0,  and  cos(l/x)  oscillates  between  the  limits  of 
indetermination  —1  and  1,  so  that  4>'(x)  oscillates  between  the  same  limits. 

What  is  practically  the  same  example  enables  us  also  to  illustrate  the  point 
referred  to  at  the  end  of  §  121.  Let 


<t>(x)  =  x2  sin(l/x)  +  ax, 


where  0  <  a  <  1,  when  x  /  0,  and  0(0)  =  0.  Then  0'(O)  =  a  >  0.  Thus  the 
conditions  of  Theorem  A  of  §  121  are  satisfied.  But  if  x  ^  0  then 


0/(x)  =  2xsin(l/x)  —  cos(l/x)  +  a, 
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which  oscillates  between  the  limits  of  indetermination  a  —  1  and  a  +  1  as  x  — >  0. 
As  a  —  1  <  0,  we  can  find  values  of  x,  as  near  to  0  as  we  like,  for  which  (j)'{x)  <  0; 
and  it  is  therefore  impossible  to  find  any  interval,  including  x  =  0,  throughout 
which  4>(x)  is  a  steadily  increasing  function  of  x. 

It  is,  however,  impossible  that  4>\x)  should  have  what  was  called  in  Ch.  V 
(Ex.  XXXVII.  18)  a  ‘simple’  discontinuity;  e.g.  that  (j>'(x)  — >  a  when  x  — >  +0, 
(j)\x)  — >  b  when  x  ->  —0,  and  r//(0)  =  c,  unless  a  =  b  =  c,  in  which  case  <f/(x)  is 
continuous  for  x  =  0.  For  a  proof  see  §  125,  Ex.  xlvii.  3. 


Examples  XLVI.  1.  Verify  Theorem  B  when  </>(x )  =  (x  —  a)m(x  —  b)n 
or  cj)(x )  =  (x  —  a)m(x  —  b)n(x  —  c)p,  where  m,  n ,  p  are  positive  integers  and 
a  <  b  <  c. 

[The  first  function  vanishes  for  x  =  a  and  x  =  b.  And 

4>'{x)  =  ( x  —  a)m_1(x  —  6)n_1{(m.  +  n)x  —  mb  —  na} 

vanishes  for  x  =  ( mb  +  na)/(m  +  n),  which  lies  between  a  and  b.  In  the  second 
case  we  have  to  verify  that  the  quadratic  equation 

(m  +  n  +  p)x2  —  {m(b  +  c)  +  n(c  +  a)  +  p(a  +  b)}x  +  mbc  +  nca  +  pab  =  0 

has  roots  between  a  and  b  and  between  b  and  c.\ 

2.  Show  that  the  polynomials 

2x3  +  3x2  -  12x  +  7,  3x4  +  8x3  -  6x2  -  24x  +  19 
are  positive  when  x  >  1. 

3.  Show  that  x  —  sin  x  is  an  increasing  function  throughout  any  interval  of 
values  of  x,  and  that  tanx  —  x  increases  as  x  increases  from  —  to  ^7r.  For 
what  values  of  a  is  ax  —  sinx  a  steadily  increasing  or  decreasing  function  of  x? 

4.  Show  that  tanx  —  x  also  increases  from  x  =  to  x  =  37r,  from  x  =  |vr 
to  x  =  1 7 r,  and  so  on,  and  deduce  that  there  is  one  and  only  one  root  of  the 
equation  tanx  =  x  in  each  of  these  intervals  (cf.  Ex.  XVII.  4). 

5.  Deduce  from  Ex.  3  that  sinx  —  x  <  0  if  x  >  0,  from  this  that 
cosx  —  1  +  t;X2  >  0,  and  from  this  that  sinx  —  x  +  |x3  >  0.  And,  generally, 
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prove  that  if 


„2  m 


c2m  =  cosx  -  1  +  — - (-l)r 


S2m+ 1  =  sinx  -  x  +  — - (-1)' 


(2  m) !  ’ 

x2m+l 

(2m  +  1)! 


and  x  >  0,  then  C2m  and  £2m+i  are  positive  or  negative  according  as  m  is  odd 
or  even. 

6.  If  f(x)  and  f"(x)  are  continuous  and  have  the  same  sign  at  every  point 
of  an  interval  [a,  b\,  then  this  interval  can  include  at  most  one  root  of  either  of 
the  equations  f(x)  =  0,  f'(x)  =  0. 

7.  The  functions  u,  v  and  their  derivatives  v! ,  v'  are  continuous  throughout 
a  certain  interval  of  values  of  x,  and  uv'  —  u'v  never  vanishes  at  any  point  of 
the  interval.  Show  that  between  any  two  roots  of  u  =  0  lies  one  of  v  =  0,  and 
conversely.  Verify  the  theorem  when  u  =  cosx,  v  =  since. 

[If  v  does  not  vanish  between  two  roots  of  u  =  0,  say  a.  and  /3,  then  the 
function  u/v  is  continuous  throughout  the  interval  [a,/3]  and  vanishes  at  its 
extremities.  Hence  (u/v)'  =  (u'v  —  uv')/v2  must  vanish  between  a  and  /3,  which 
contradicts  our  hypothesis.] 

8.  Determine  the  maxima  and  minima  (if  any)  of  (x  —  l)2(x  +  2),  x3  —  3x, 
2x 3  —  3x2  —  36x  +  10,  4x3  —  18x2  +  27x  —  7,  3x4  —  4x3  +  1,  x5  —  15x3  +  3.  In 
each  case  sketch  the  form  of  the  graph  of  the  function. 

[Consider  the  last  function,  for  example.  Here  cj)'(x)  =  5x2(x2  —  9),  which 
vanishes  for  x  =  —  3,  x  =  0,  and  x  =  3.  It  is  easy  to  see  that  x  =  —  3  gives  a 
maximum  and  x  =  3  a  minimum,  while  x  =  0  gives  neither,  as  </>'(x)  is  negative 
on  both  sides  of  x  =  0.] 

9.  Discuss  the  maxima  and  minima  of  the  function  (x  —  a)m(x  —  b)n,  where 
m  and  n  are  any  positive  integers,  considering  the  different  cases  which  occur 
according  as  rn  and  n  are  odd  or  even.  Sketch  the  graph  of  the  function. 

10.  Discuss  similarly  the  function  (x  —  a)(x  —  b)2(x  —  c)3,  distinguishing  the 
different  forms  of  the  graph  which  correspond  to  different  hypotheses  as  to  the 
relative  magnitudes  of  a,  b,  c. 

11.  Show  that  (ax  +  b) /(ex  +  d)  has  no  maxima  or  minima,  whatever  values 
a,  b,  c,  d  may  have.  Draw  a  graph  of  the  function. 
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12.  Discuss  the  maxima  and  minima  of  the  function 

y  =  ( ax 2  +  2  bx  +  c)/  ( Ax 2  +  2  Bx  +  C), 

when  the  denominator  has  complex  roots. 

[We  may  suppose  a  and  A  positive.  The  derivative  vanishes  if 

(ax  +  b)(Bx  +  C)  —  (Ax  +  B)(bx  +  c)  =  0.  (1) 

This  equation  must  have  real  roots.  For  if  not  the  derivative  would  always  have 
the  same  sign,  and  this  is  impossible,  since  y  is  continuous  for  all  values  of  x, 
and  y  — >  a/ A  as  x  — >  +oo  or  x  — »  — oo.  It  is  easy  to  verify  that  the  curve  cuts 
the  line  y  =  a/ A  in  one  and  only  one  point,  and  that  it  lies  above  this  line  for 
large  positive  values  of  x,  and  below  it  for  large  negative  values,  or  vice  versa , 
according  as  6/a  >  B/A  or  6/a  <  B/A.  Thus  the  algebraically  greater  root 
of  (1)  gives  a  maximum  if  6/a  >  B/A,  a  minimum  in  the  contrary  case.] 

13.  The  maximum  and  minimum  values  themselves  are  the  values  of  A  for 
which  ax2  +  2bx+c—  X(Ax2  +  2Bx+C)  is  a  perfect  square.  [This  is  the  condition 
that  y  =  A  should  touch  the  curve.] 

14.  In  general  the  maxima  and  maxima  of  R(x)  =  P(x)/Q(x)  are  among  the 
values  of  A  obtained  by  expressing  the  condition  that  P(x)  —  A Q(x)  =  0  should 
have  a  pair  of  equal  roots. 

15.  If  Ax2  +  2 Bx  +  (7  =  0  has  real  roots  then  it  is  convenient  to  proceed  as 
follows.  We  have 


y  -  (a/ A)  =  (Ax  +  y)/{A(Ax2  +  2 Bx  +  (7)}, 

where  A  =  bA  —  aB,  y  =  cA  —  aC.  Writing  further  £  for  Ax  +  y  and  rj  for 
(A/\2)(Ay  —  a),  we  obtain  an  equation  of  the  form 

v  =  £/{(£-?)(£-?)}• 


This  transformation  from  (x,  y)  to  (£,  r/)  amounts  only  to  a  shifting  of  the 
origin,  keeping  the  axes  parallel  to  themselves,  a  change  of  scale  along  each  axis, 
and  (if  A  <  0)  a  reversal  in  direction  of  the  axis  of  abscissae;  and  so  a  minimum 
of  y,  considered  as  a  function  of  x,  corresponds  to  a  minimum  of  y  considered 
as  a  function  of  £,  and  vice  versa,  and  similarly  for  a  maximum. 
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The  derivative  of  q  with  respect  to  £  vanishes  if 

(£  -  p)(i  -  q)  -  £(£  -  p)  - l(£  -  9)  =  0, 

or  if  £2  =  pq.  Thus  there  are  two  roots  of  the  derivative  if  p  and  q  have  the  same 
sign,  none  if  they  have  opposite  signs.  In  the  latter  case  the  form  of  the  graph 


When  p  and  q  are  positive  the  general  form  of  the  graph  is  as  shown  in 
Fig  41b,  and  it  is  easy  to  see  that  £  =  yfpq  gives  a  maximum  and  £  =  — y/pq  a 
minimum.* 

In  the  particular  case  in  which  p  =  q  the  function  is 

v  =  £/(£,-p)2, 

and  its  graph  is  of  the  form  shown  in  Fig.  41c. 

The  preceding  discussion  fails  if  A  =  0,  i.e.  if  a/ A  =  b/B.  But  in  this  case 
we  have 

y  —  (a/ A)  =  p/{A{Ax2  +  2  Bx  +  C)} 

=  p/{A2(x  -  x{){x  -  x2)}, 

say,  and  dy/dx  =  0  gives  the  single  value  x  =  \{x\  +  x2)-  On  drawing  a  graph 
it  becomes  clear  that  this  value  gives  a  maximum  or  minimum  according  as  p  is 
positive  or  negative.  The  graph  shown  in  Fig.  42  corresponds  to  the  former  case. 


*The  maximum  is  —  1  /{y/p—  y/q)'2,  the  minimum  —  l/(y/p+y/q)2,  of  which  the  latter 
is  the  greater. 


[VI :  124] 


DERIVATIVES  AND  INTEGRALS 


272 


[A  full  discussion  of  the  general  function  y  =  (ax2  +  2 bx+c)/ (Ax2  +  2 Bx  +  C) , 
by  purely  algebraical  methods,  will  be  found  in  Chrystal’s  Algebra,  vol  i, 
pp.  464-7.] 

16.  Show  that  (x  —  a)(x  —  f3)/(x  —  7)  assumes  all  real  values  as  x  varies,  if 
7  lies  between  a  and  j3,  and  otherwise  assumes  all  values  except  those  included 
in  an  interval  of  length  4 y/\a  —  7I |/3  —  7|. 


17.  Show  that 

x2  +  2  x  +  c 
^  x2  +  4x  +  3c 

can  assume  any  real  value  if  0  <  c  <  1,  and  draw  a  graph  of  the  function  in  this 
case.  (Math.  Trip.  1910.) 

18.  Determine  the  function  of  the  form  (ax2  +  2 bx  +  c)/(Ax2  +  2 Bx  +  C ) 
which  has  turning  values  (i.e.  maxima  or  minima)  2  and  3  when  x  =  1  and 
x  =  —  1  respectively,  and  has  the  value  2.5  when  x  =  0.  (Math.  Trip.  1908.) 

19.  The  maximum  and  minimum  of  (x  +  a)(x  +  b)/(x  —  a)(x  —  b ),  where 
a  and  b  are  positive,  are 


fVa  +  VbV 
\Vh-Vb) 


20.  The  maximum  value  of  (x  —  l)2/(x  +  l)3  is  . 

21.  Discuss  the  maxima  and  minima  of 

x(x  —  l)/(x 2  +  3x  +  3),  x4/  (x  —  l)(x  —  3)3, 

(x  -  l)2(3x2  -  2x  -  37) /(x  +  5)2(3x2  -  14x  -  1). 

(Math.  Trip.  1898.) 
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[If  the  last  function  be  denoted  by  P(x)/Q(x),  it  will  be  found  that 
P'Q  -  PQ'  =  72(x  -  7)(x  -  3)(x  -  l)(x  +  l)(x  +  2)(x  +  5).] 

22.  Find  the  maxima  and  minima  of  acosx  +  b  sin  re.  Verify  the  result  by 
expressing  the  function  in  the  form  A  cos(x  —  a). 

23.  Find  the  maxima  and  minima  of 

a2  cos2  x  +  b2  sin2  x,  A  cos2  x  +  2 H  cos  x  sin  x  +  B  sin2  x. 

24.  Show  that  sin(x  +  a)/ sin  (a:  +  b)  has  no  maxima  or  minima.  Draw  a 
graph  of  the  function. 

25.  Show  that  the  function 

sin2  x 

T— 7 - 7 ; 7 - TT  (0  <  a  <  b  <  7r) 

sm(x  +  a)  sm(x  +  b) 

has  an  infinity  of  minima  equal  to  0  and  of  maxima  equal  to 

—4  sin  a  sin  6/  sin2  (a  —  b). 

(Math.  Trip.  1909.) 

26.  The  least  value  of  a 2  sec2  x  +  b2  cosec2  x  is  (a  +  b)2. 

27.  Show  that  tan3xcot2x  cannot  lie  between  |  and 

28.  Show  that,  if  the  sum  of  the  lengths  of  the  hypothenuse  and  another  side 

of  a  right-angled  triangle  is  given,  then  the  area  of  the  triangle  is  a  maximum 
when  the  angle  between  those  sides  is  60°.  (Math.  Trip.  1909.) 

29.  A  line  is  drawn  through  a  fixed  point  (a,  b)  to  meet  the  axes  OX,  OY 
in  P  and  Q.  Show  that  the  minimum  values  of  PQ,  OP  +  OQ,  and  OP  ■  OQ 
are  respectively  (a2/3  +  62/3)3/2,  (^/a  +  Vb)2,  and  4a6. 

30.  A  tangent  to  an  ellipse  meets  the  axes  in  P  and  Q.  Show  that  the  least 
value  of  PQ  is  equal  to  the  sum  of  the  semiaxes  of  the  ellipse. 

31.  Find  the  lengths  and  directions  of  the  axes  of  the  conic 

ax 2  +  2  hxy  +  by2  =  1. 

[The  length  r  of  the  semi-diameter  which  makes  an  angle  9  with  the  axis 
of  x  is  given  by 

1/r2  =  a  cos2  9  +  2 h  cos  9  sin  9  +  b  sin2  9. 
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The  condition  for  a  maximum  or  minimum  value  of  r  is  tan  2 9  =  2 h/(a  —  b). 
Eliminating  9  between  these  two  equations  we  find 

{a-(l/r2)}{6-(l/r2)}  =  62.] 

32.  The  greatest  value  of  xmyn ,  where  x  and  y  are  positive  and  x  +  y  =  k, 
is 

mmnnkm+n  /  (m  +  n)m+n. 

33.  The  greatest  value  of  ax  +  by,  where  x  and  y  are  positive  and 
x2  +  xy  +  y2  =  3k2,  is 

2  K\/a2  —  ab  +  b2. 

[If  ax  +  by  is  a  maximum  then  a  +  b(dy/dx)  =  0.  The  relation  between 
x  and  y  gives  (2x  +  y)  +  (x  +  2 y){dy/dx)  =  0.  Equate  the  two  values  of  dy/dx] 

34.  If  9  and  0  are  acute  angles  connected  by  the  relation  a  sec  9-\-b  sec  (j>  =  c, 
where  a,  b,  c  are  positive,  then  a  cos  9  +  6cos0  is  a  minimum  when  6  =  0. 

125.  The  Mean  Value  Theorem.  We  can  proceed  now  to  the  proof 
of  another  general  theorem  of  extreme  importance,  a  theorem  commonly 
known  as  ‘  The  Mean  Value  Theorem,'  or  lThe  Theorem  of  the  Mean'. 

Theorem.  If  <f(x)  has  a  derivative  for  all  values  of  x  in  the  inter¬ 
val  [ a,b ],  then  there  is  a  value  f  of  x  between  a  and  b,  such  that 

0(6)  -0(a)  =  (6  —  a)0'(£). 


Before  we  give  a  strict  proof  of  this  theorem,  which  is  perhaps  the  most 
important  theorem  in  the  Differential  Calculus,  it  will  be  well  to  point 
out  its  obvious  geometrical  meaning.  This  is  simply  (see  Fig.  43)  that  if 
the  curve  APB  has  a  tangent  at  all  points  of  its  length  then  there  must 
be  a  point,  such  as  P,  where  the  tangent  is  parallel  to  AB.  For  0'(£)  is 
the  tangent  of  the  angle  which  the  tangent  at  P  makes  with  OX,  and 
{0(6)  —  0(a)  }/(6  —  a)  the  tangent  of  the  angle  which  AB  makes  with  OX. 

It  is  easy  to  give  a  strict  analytical  proof.  Consider  the  function 

-  7— r{0O)  -0(a)}, 

U  CL 


0(6)  -  0(x) 
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which  vanishes  when  x  =  a  and  x  —  b.  It  follows  from  Theorem  B  of  §  121 
that  there  is  a  value  £  for  which  its  derivative  vanishes.  But  this  derivative 

is 

m~  m  _  m 

b  —  a 

which  proves  the  theorem.  It  should  be  observed  that  it  has  not  been 
assumed  in  this  proof  that  (j>'{x)  is  continuous. 

It  is  often  convenient  to  express  the  Mean  Value  Theorem  in  the  form 

4>(b)  =  4>(a)  +  (b  —  a)4>' {a  +  9{b  —  a)}, 

where  6  is  a  number  lying  between  0  and  1.  Of  course  a  +  6{b  —  a)  is 
merely  another  way  of  writing  ‘some  number  £  between  a  and  b\  If  we  put 
b  =  a  +  h  we  obtain 


4>(a  +  h)  =  4>(a)  +  h(j)\a  +  6h), 
which  is  the  form  in  which  the  theorem  is  most  often  quoted. 


Examples  XLVII.  1.  Show  that 

£>(&)  -  <f>{x)  -  j — -{(j){b)  -  0(a)} 
b  —  a 


is  the  difference  between  the  ordinates  of  a  point  on  the  curve  and  the  corre¬ 
sponding  point  on  the  chord. 
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2.  Verify  the  theorem  when  0(x)  =  x 2  and  when  <f(x)  =  x3 . 

[In  the  latter  case  we  have  to  prove  that  (b3  —  a3)/(b  —  a)  =  3£2,  where 
a  <  £  <  6;  i.e.  that  if  |(62  +  ab  +  a2)  =  £2  then  £  lies  between  a  and  b.} 

3.  Establish  the  theorem  stated  at  the  end  of  §  124  by  means  of  the  Mean 
Value  Theorem. 

[Since  0'( 0)  =  c,  we  can  find  a  small  positive  value  of  x  such  that 

{4>(x)  —  0( 0)}/x  is  nearly  equal  to  c;  and  therefore,  by  the  theorem,  a  small 

positive  value  of  £  such  that  0'(£)  is  nearly  equal  to  c,  which  is  inconsistent 

with  lim  (f)' (x)  =  a,  unless  a  =  c.  Similarly  b  =  c.] 

£->•  +  0 

4.  Use  the  Mean  Value  Theorem  to  prove  Theorem  (6)  of  §  113,  assuming 
that  the  derivatives  which  occur  are  continuous. 

[The  derivative  of  F{f(x)}  is  by  definition 

lim  F{f(x  +  h)}  —  F{f(x)}  ' 
h 

But,  by  the  Mean  Value  Theorem,  f{x  +  h )  =  f(x)  +  hf(^),  where  £  is  a  number 
lying  between  x  and  x  +  h.  And 

F{f(x)  +  hf(0}  =  F{f{*)}  +  hf'(0  F\Ci), 

where  £i  is  a  number  lying  between  f(x )  and  /(x,)  +  /i//(£).  Hence  the  derivative 
of  F{f(x)}  is 

lim  /'(£)F'(£1)  =  //(x)F'{/(x)}, 
since  £  — >  x  and  £i  — >  f(x)  as  h  — >  0.] 

126.  The  Mean  Value  Theorem  furnishes  us  with  a  proof  of  a  result 
which  is  of  great  importance  in  what  follows:  if  <f>'(x)  =  0,  throughout 
a  certain  interval  of  values  of  x,  then  <f(x)  is  constant  throughout  that 
interval. 

For,  if  a  and  b  are  any  two  values  of  x  in  the  interval,  then 

4>(b)  —  0(a)  =  (b  —  a)0/{a  +  9{b  —  a)}  =  0. 

An  immediate  corollary  is  that  if  <f'(x)  =  0'( x ),  throughout  a  certain 
interval,  then  the  functions  <f>{x)  and  if(x)  differ  throughout  that  interval 
by  a  constant. 
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127.  Integration.  We  have  in  this  chapter  seen  how  we  can  find  the 
derivative  of  a  given  function  <f>(x)  in  a  variety  of  cases,  including  all  those  of 
the  commonest  occurrence.  It  is  natural  to  consider  the  converse  question, 
that  of  determining  a  function  whose  derivative  is  a  given  function. 

Suppose  that  if{x)  is  the  given  function.  Then  we  wish  to  determine 
a  function  such  that  <j}'{x)  =  if{x).  A  little  reflection  shows  us  that  this 
question  may  really  be  analysed  into  three  parts. 

(1)  In  the  first  place  we  want  to  know  whether  such  a  function  as  (j>{x) 
actually  exists.  This  question  must  be  carefully  distinguished  from  the 
question  as  to  whether  (supposing  that  there  is  such  a  function)  we  can 
find  any  simple  formula  to  express  it. 

(2)  We  want  to  know  whether  it  is  possible  that  more  than  one  such 
function  should  exist,  i.e.  we  want  to  know  whether  our  problem  is  one 
which  admits  of  a  unique  solution  or  not;  and  if  not,  we  want  to  know 
whether  there  is  any  simple  relation  between  the  different  solutions  which 
will  enable  us  to  express  all  of  them  in  terms  of  any  particular  one. 

(3)  If  there  is  a  solution,  we  want  to  know  how  to  find  an  actual  ex¬ 
pression  for  it. 

It  will  throw  light  on  the  nature  of  these  three  distinct  questions  if 
we  compare  them  with  the  three  corresponding  questions  which  arise  with 
regard  to  the  differentiation  of  functions. 

(1)  A  function  (j>{x)  may  have  a  derivative  for  all  values  of  x,  like  xm, 
where  m  is  a  positive  integer,  or  sin  x.  It  may  generally,  but  not  always  have 
one,  like  tyx  or  tan  x  or  sec  x.  Or  again  it  may  never  have  one:  for  example, 
the  function  considered  in  Ex.  XXXVII.  20,  which  is  nowhere  continuous,  has 
obviously  no  derivative  for  any  value  of  x.  Of  course  during  this  chapter  we 
have  confined  ourselves  to  functions  which  are  continuous  except  for  some 
special  values  of  x.  The  example  of  the  function  tfx,  however,  shows  that 
a  continuous  function  may  not  have  a  derivative  for  some  special  value 
of  x,  in  this  case  x  —  0.  Whether  there  are  continuous  functions  which 
never  have  derivatives,  or  continuous  curves  which  never  have  tangents,  is 
a  further  question  which  is  at  present  beyond  us.  Common-sense  says  No: 
but,  as  we  have  already  stated  in  §  111,  this  is  one  of  the  cases  in  which 
higher  mathematics  has  proved  common-sense  to  be  mistaken. 
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But  at  any  rate  it  is  clear  enough  that  the  question  ‘has  <p(x)  a  deriva¬ 
tive  <p' {x)T  is  one  which  has  to  be  answered  differently  in  different  cir¬ 
cumstances.  And  we  may  expect  that  the  converse  question  ‘is  there  a 
function  <f{x)  of  which  if{x)  is  the  derivative?’  will  have  different  answers 
too.  We  have  already  seen  that  there  are  cases  in  which  the  answer  is  No: 
thus  if  ip{x)  is  the  function  which  is  equal  to  a,  b,  or  c  according  as  x  is 
less  than,  equal  to,  or  greater  than  0,  then  the  answer  is  No  (Ex.  XLVII.  3), 
unless  a  =  b  =  c. 

This  is  a  case  in  which  the  given  function  is  discontinuous.  In  what 
follows,  however,  we  shall  always  suppose  if{x)  continuous.  And  then  the 
answer  is  Yes:  if  ijj(x)  is  continuous  then  there  is  always  a  function  <ffx) 
such  that  (j)'{x)  =  if(x).  The  proof  of  this  will  be  given  in  Cli.  VII. 

(2)  The  second  question  presents  no  difficulties.  In  the  case  of  differ¬ 
entiation  we  have  a  direct  definition  of  the  derivative  which  makes  it  clear 
from  the  beginning  that  there  cannot  possibly  be  more  than  one.  In  the 
case  of  the  converse  problem  the  answer  is  almost  equally  simple.  It  is  that 
if  «p(x)  is  one  solution  of  the  problem  then  <f>(x)  +  C  is  another,  for  any 
value  of  the  constant  C ,  and  that  all  possible  solutions  are  comprised  in 
the  form  <f>(x)  +  C.  This  follows  at  once  from  §  126. 

(3)  The  practical  problem  of  actually  finding  4>'{x)  is  a  fairly  simple 
one  in  the  case  of  any  function  defined  by  some  finite  combination  of  the 
ordinary  functional  symbols.  The  converse  problem  is  much  more  difficult. 
The  nature  of  the  difficulties  will  appear  more  clearly  later  on. 

Definitions.  If  fj(x)  is  the  derivative  of  <f>(x),  then  we  call  <f>(x )  an 
integral  or  integral  function  of  if{x).  The  operation  of  forming  if(x) 
from  4>(x)  we  call  integration. 

We  shall  use  the  notation 


Hx) 


I^(x)dx- 


It  is  hardly  necessary  to  point  out  that  J ...  dx  like  d/dx  must,  at  present 
at  any  rate,  be  regarded  purely  as  a  symbol  of  operation:  the  f  and  the  dx 
no  more  mean  anything  when  taken  by  themselves  than  do  the  d  and  dx 
of  the  other  operative  symbol  d/dx. 
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128.  The  practical  problem  of  integration.  The  results  of  the 
earlier  part  of  this  chapter  enable  us  to  write  down  at  once  the  integrals  of 
some  of  the  commonest  functions.  Thus 


(1) 


These  formulae  must  be  understood  as  meaning  that  the  function  on 
the  right-hand  side  is  one  integral  of  that  under  the  sign  of  integration. 
The  most  general  integral  is  of  course  obtained  by  adding  to  the  former  a 
constant  C,  known  as  the  arbitrary  constant  of  integration. 

There  is  however  one  case  of  exception  to  the  first  formula,  that  in 
which  m  =  —1.  In  this  case  the  formula  becomes  meaningless,  as  is  only 
to  be  expected,  since  we  have  seen  already  (Ex.  xlii.  4)  that  1/x  cannot 
be  the  derivative  of  any  polynomial  or  rational  fraction. 

That  there  really  is  a  function  F(x)  such  that  DxF(x)  =  1/x  will 
be  proved  in  the  next  chapter.  For  the  present  we  shall  be  content  to 
assume  its  existence.  This  function  F(x)  is  certainly  not  a  polynomial 
or  rational  function;  and  it  can  be  proved  that  it  is  not  an  algebraical 
function.  It  can  indeed  be  proved  that  F(x)  is  an  essentially  new  function, 
independent  of  any  of  the  classes  of  functions  which  we  have  considered  yet, 
that  is  to  say  incapable  of  expression  by  means  of  any  finite  combination 
of  the  functional  symbols  corresponding  to  them.  The  proof  of  this  is 
unfortunately  too  detailed  and  tedious  to  be  inserted  in  this  book;  but 
some  further  discussion  of  the  subject  will  be  found  in  Ch.  IX,  where  the 
properties  of  F(x)  are  investigated  systematically. 

Suppose  first  that  x  is  positive.  Then  we  shall  write 


(2) 


and  we  shall  call  the  function  on  the  right-hand  side  of  this  equation  the 
logarithmic  function:  it  is  defined  so  far  only  for  positive  values  of  x. 

Next  suppose  x  negative.  Then  —  x  is  positive,  and  so  log(— x)  is  defined 
by  what  precedes.  Also 


—x  x 


-1  1 
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so  that,  when  x  is  negative, 


I  —  =  log(-z).  (3) 

J  x 

The  formulae  (2)  and  (3)  may  be  united  in  the  formulae 

f  dx 

—  =  log(±a;)  =  log  hi,  (4) 

J  x 

where  the  ambiguous  sign  is  to  be  chosen  so  that  ±x  is  positive:  these 
formulae  hold  for  all  real  values  of  x  other  than  x  =  0. 

The  most  fundamental  of  the  properties  of  logx  which  will  be  proved  in 
Ch.  IX  are  expressed  by  the  equations 

log  1  =  0,  log(l/x)  =  —  logic,  log  xy  =  log  x  +  log  y, 

of  which  the  second  is  an  obvious  deduction  from  the  first  and  third.  It  is  not 
really  necessary,  for  the  purposes  of  this  chapter,  to  assume  the  truth  of  any  of 
these  formulae;  but  they  sometimes  enable  us  to  write  our  formulae  in  a  more 
compact  form  than  would  otherwise  be  possible. 

It  follows  from  the  last  of  the  formulae  that  log  x 2  is  equal  to  2  log  x  if  x  >  0 
and  to  2  log(— x)  if  x  <  0,  and  in  either  case  to  2  log  |x|.  Either  of  the  formulae  (4) 
is  therefore  equivalent  to  the  formula 

J~  =  llo^x2-  (5) 


The  five  formulae  (l)-(3)  are  the  five  most  fundamental  standard  forms 
of  the  Integral  Calculus.  To  them  should  be  added  two  more,  viz. 


dx 


1  +  x1 


=  arc  tan  x, 


x 


vT3 


=  ±  arc  sin  x. 


xz 


(6) 


See  §  119  for  the  rule  for  determining  the  ambiguous  sign. 
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129.  Polynomials.  All  the  general  theorems  of  §  113  may  of  course 
also  be  stated  as  theorems  in  integration.  Thus  we  have,  to  begin  with, 
the  formulae 


j {f(x)  +  F(x)}  dx  = 

[mdx+fFoux, 

(1) 

j kf(x ) dx  = 

k  J  f(x)  dx. 

(2) 

Here  it  is  assumed,  of  course,  that  the  arbitrary  constants  are  adjusted 
properly.  Thus  the  formula  (1)  asserts  that  the  sum  of  any  integral  of  f{x) 
and  any  integral  of  F(x)  is  an  integral  of  f(x)  +  F(x). 

These  theorems  enable  us  to  write  down  at  once  the  integral  of  any 
function  of  the  form  Avfv(x),  the  sum  of  a  finite  number  of  constant 
multiples  of  functions  whose  integrals  are  known.  In  particular  we  can 
write  down  the  integral  of  any  polynomial :  thus 

[  ( a0xn  +  aixn-1  +  ■  ■  ■  +  an)  dx  =  — —  +  —  +  •  ■  ■  +  anx. 

J  n  + 1  n 


130.  Rational  Functions.  After  integrating  polynomials  it  is  natu¬ 
ral  to  turn  our  attention  next  to  rational  functions.  Let  us  suppose  R(x)  to 
be  any  rational  function  expressed  in  the  standard  form  of  §  117,  viz.  as  the 
sum  of  a  polynomial  n(x)  and  a  number  of  terms  of  the  form  A/(x  —  a)p. 

We  can  at  once  write  down  the  integrals  of  the  polynomial  and  of  all 
the  other  terms  except  those  for  which  p  —  1,  since 

f _ — _ dx  = _ — _ - _ 

J(x-a)P  p-l(x-  a)p~l  ’ 

whether  a  be  real  or  complex  (§  117). 

The  terms  for  which  p  —  1  present  rather  more  difficulty.  It  follows 
immediately  from  Theorem  (6)  of  §  113  that 

f  F'{f(x)}  f\x)  dx  =  F{f(x)}. 


(3) 
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In  particular,  if  we  take  f(x)  =  ax  +  6,  where  a  and  b  are  real,  and  write 
c f>(x )  for  F(x )  and  ip(x)  for  F'(x),  so  that  (p(x )  is  an  integral  of  i[(x),  we 
obtain 


/i[(ax  +  b)  dx  —  -Max  +  b). 
a 


(4) 


Thus,  for  example, 


dx 


ax 


-  log  |  ax  +  61, 
a 


and  in  particular,  if  a  is  real, 


[dx 

/  - =  log  \x  —  a \. 

J  x  —  a 

We  can  therefore  write  down  the  integrals  of  all  the  terms  in  R(x)  for 
which  p  =  1  and  a  is  real.  There  remain  the  terms  for  which  p  =  1  and 
a  is  complex. 

In  order  to  deal  with  these  we  shall  introduce  a  restrictive  hypothesis, 
viz.  that  all  the  coefficients  in  R(x)  are  real.  Then  if  a  —  7  +  Si  is  a 
root  of  Q(x)  =  0,  of  multiplicity  m,  so  is  its  conjugate  a  =  7  —  5i]  and 
if  a  partial  fraction  Ap/(x  —  a)p  occurs  in  the  expression  of  R(x),  so  does 
Ap/(x  —  a)p,  where  Ap  is  conjugate  to  Ap.  This  follows  from  the  nature 
of  the  algebraical  processes  by  means  of  which  the  partial  fractions  can  be 
found,  and  which  are  explained  at  length  in  treatises  on  Algebra.* 

Thus,  if  a  term  (A  +  pi)/(x  —  7  —  5i)  occurs  in  the  expression  of  R(x) 
in  partial  fractions,  so  will  a  term  (A  —  pi)/(x  —  7  +  Si);  and  the  sum  of 
these  two  terms  is 

2{A(ff  —  7)  -fj.6} 

(x  —  7)2  +  S2 

This  fraction  is  in  reality  the  most  general  fraction  of  the  form 

Ax  +  B 
ax2  +  2bx  +  c 


*See,  for  example,  Chrystal’s  Algebra ,  vol.  i,  pp.  151-9. 
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where  b 2  <  ac.  The  reader  will  easily  verify  the  equivalence  of  the  two 
forms,  the  formulae  which  express  A,  /i,  7,  6  in  terms  of  A,  B,  a,  b,  c  being 

X  =  A/2a,  p  =  —  D  /  (2a\fK) ,  7  = —b/a,  5  —  V~A/a , 

where  A  =  ac  —  b2,  and  D  =  aB  —  bA. 

If  in  (3)  we  suppose  F{f(x)}  to  be  log  |/(x)|,  we  obtain 

f  JJfff  dx  =  logl/(^)l;  (5) 

and  if  we  further  suppose  that  f{x)  =  (x  —  A)2  +  r2,  we  obtain 

/  {x2- +  dx  =  los{(*  ~  A>2  +  '|2}- 
And,  in  virtue  of  the  equations  (6)  of  §  128  and  (4)  above,  we  have 

—25 11  ,  ( x  —  A 

: - — - - -  ax  =  —Id  arc  tan  - 

x  —  A )l  +  fi2  \  n 

These  two  formulae  enable  us  to  integrate  the  sum  of  the  two  terms 
which  we  have  been  considering  in  the  expression  of  R(x);  and  we  are  thus 
enabled  to  write  down  the  integral  of  any  real  rational  function,  if  all  the 
factors  of  its  denominator  can  be  determined.  The  integral  of  any  such 
function  is  composed  of  the  sum  of  a  polynomial,  a  number  of  rational 
functions  of  the  type 

A  1 
p  —  1  (x  —  ay-1  ’ 

a  number  of  logarithmic  functions,  and  a  number  of  inverse  tangents. 

It  only  remains  to  add  that  if  a  is  complex  then  the  rational  function 
just  written  always  occurs  in  conjunction  with  another  in  which  A  and  a 
are  replaced  by  the  complex  numbers  conjugate  to  them,  and  that  the  sum 
of  the  two  functions  is  a  real  rational  function. 
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Examples  XLVIII.  1.  Prove  that 


Ax  +  B 


A 


D 


,  „  dx  = — log|A'|  +  . _ 

J  ax2  +  2 bx  +  c  2a  2 ay/— A 

(where  X  =  ax 2  +  bx  +  c)  if  A  <  0,  and 


log 


ax 


+  b-V^ A 


ax 


+  b+y/^A 


Ax  +  B  A  D 

— 7, - ; - dx  =  —  log  A  H - —  arc  tan 

ax2  +  2  bx  +  c  2  a  b  1  1  2ay/A 


ax  +  b 


if  A  >  0,  A  and  D  having  the  same  meanings  as  on  p.  283. 

2.  In  the  particular  case  in  which  ac  =  b2  the  integral  is 

D  A.  . 

- 7 - rxT  +  -  l°g \ax  +  o  . 

a{ax  +  b)  a 


3.  Show  that  if  the  roots  of  Q(x)  =  0  are  all  real  and  distinct,  and  P(x)  is 
of  lower  degree  than  Q(x),  then 

J  R{x)  dx  =  log  I*  -  a|> 

the  summation  applying  to  all  the  roots  a  of  Q(x)  =  0. 

[The  form  of  the  fraction  corresponding  to  a  may  be  deduced  from  the  facts 
that 

Q(x)  .  nf,  x  /  x  .  p(oi) 

Q  (a),  (x  —  a)n(x) 


x  —  a 


as  x  — >  a .] 

4.  If  all  the  roots  of  Q{x)  are  real  and  a  is  a  double  root,  the  other  roots 
being  simple  roots,  and  P{x)  is  of  lower  degree  than  Q(x),  then  the  integral  is 
A/ (x  —  a)  +  A'  log  \x  —  a\  +  ^  B  log  \x  —  /3\,  where 


A=  2P(q)  A,  _  2{3 P'(a)Q"(a)  -  P(a)Q"'(a )} 


Q"(a) 


3{Q"(a)}2 


B  = 


P(P) 

Q'(py 


and  the  summation  applies  to  all  roots  [3  of  Q(x)  =  0  other  than  a. 

5.  Calculate 

r  dx 


{(x  —  1)(.T2  +  l)}2  ' 
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[The  expression  in  partial  fractions  is 

1  1  i  2  —  i  i  2  +  i 

4(x  —  1  )2  2(x  —  1)  8 (x  —  i)2  8 (x  —  i)  8(x  +  i)2  8(x  +  *)  ’ 

and  the  integral  is 

}  -  ^log|x  -  1|  +  llog(x2  +  1)  +  \  arc  tan  x.] 

4(x  —  1)  4  (x2  +  1) 


6.  Integrate 


x  x 

(x  —  a)(x  —  b)(x  —  c )  ’  (x  —  a)2(x  —  b )  ’ 

2 

X  X 

(x2  +  a2)(x2  +  b2)  ’  (x2  +  a2)(x2  T  b)2  ’ 


x 

(x  —  a)2(x  — 


x2(x2  +  a2)  ’ 


x 


6)2’  (x  —  a)3’ 


x(x2  +  a2)2 ' 


7.  Prove  the  formulae: 


dx 


1  +  x4 
r  x2  dx 
1  +  x4 
dx 


1  +  x2  +  x4 


—  /  lQg(1+XV*  +  X2) 

4v/2l  g\l-xV2  +  x2) 

—  I  -\pg(l  +  X'/71  +  x2\ 

4^21  SVl-xV2  +  x2y 


+  2  arc  tan 
+  2  arc  tan 
+  2  arc  tan 


xy/2 

i-x2y /’ 

xV2 

i-x2)y 
xV3  \  1 
1  —  x2  /  J 


131.  Note  on  the  practical  integration  of  rational  functions. 

The  analysis  of  §  130  gives  us  a  general  method  by  which  we  can  find  the  integral 
of  any  real  rational  function  R(x),  provided  we  can  solve  the  equation  Q(x)  =  0. 
In  simple  cases  (as  in  Ex.  5  above)  the  application  of  the  method  is  fairly  simple. 
In  more  complicated  cases  the  labour  involved  is  sometimes  prohibitive,  and 
other  devices  have  to  be  used.  It  is  not  part  of  the  purpose  of  this  book  to  go 
into  practical  problems  of  integration  in  detail.  The  reader  who  desires  fuller 
information  may  be  referred  to  Goursat’s  Cours  d' ’Analyse,  second  ed.,  vol.  i, 
pp.  246  et  seq.,  Bertrand’s  Calcul  Integral ,  and  Dr  Bromwich’s  tract  Elementary 
Integrals  (Bowes  and  Bowes,  1911). 
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If  the  equation  Q{x)  =  0  cannot  be  solved  algebraically,  then  the  method  of 
partial  fractions  naturally  fails  and  recourse  must  be  had  to  other  methods.* 

132.  Algebraical  Functions.  We  naturally  pass  on  next  to  the 
question  of  the  integration  of  algebraical  functions.  We  have  to  consider 
the  problem  of  integrating  y,  where  y  is  an  algebraical  function  of  x.  It  is 
however  convenient  to  consider  an  apparently  more  general  integral,  viz. 

J  R(x,  y )  dx, 

where  R(x ,  y)  is  any  rational  function  of  x  and  y.  The  greater  generality  of 
this  form  is  only  apparent,  since  (Ex.  XIV.  6)  the  function  R(x,y)  is  itself 
an  algebraical  function  of  x.  The  choice  of  this  form  is  in  fact  dictated 
simply  by  motives  of  convenience:  such  a  function  as 

px  +  q  +  y/  ax 2  +  2  bx  +  c 
px  +  q  —  a/  ax 2  +  2  bx  +  c 

is  far  more  conveniently  regarded  as  a  rational  function  of  x  and  the  simple 
algebraical  function  \/ax2  +  2 bx  +  c,  than  directly  as  itself  an  algebraical 
function  of  x. 

133.  Integration  by  substitution  and  rationalisation.  It  follows 

from  equation  (3)  of  §  130  that  if  J  ip(x)  dx  =  <fi(x)  then 

J  Wit)}  f'(t)dt  =  0{/(t)}.  (1) 

This  equation  supplies  us  with  a  method  for  determining  the  integral 
of  ip{x)  in  a  large  number  of  cases  in  which  the  form  of  the  integral  is 
not  directly  obvious.  It  may  be  stated  as  a  rule  as  follows:  put  x  =  f(t), 

*See  the  author’s  tract  “The  integration  of  functions  of  a  single  variable”  ( Cambridge 
Tracts  in  Mathematics ,  No.  2,  second  edition,  1915).  This  does  not  often  happen  in 
practice. 
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where  f(t )  is  any  function  of  a  new  variable  t  which  it  may  be  convenient 
to  choose;  multiply  by  fit),  and  determine  (if  possible )  the  integral  of 
'ififft)}  f'(t );  express  the  result  in  terms  of  x.  It  will  often  be  found  that 
the  function  of  t  to  which  we  are  led  by  the  application  of  this  rule  is  one 
whose  integral  can  easily  be  calculated.  This  is  always  so,  for  example,  if 
it  is  a  rational  function,  and  it  is  very  often  possible  to  choose  the  relation 
between  x  and  t  so  that  this  shall  be  the  case.  Thus  the  integral  of  R(y/x), 
where  R  denotes  a  rational  function,  is  reduced  by  the  substitution  x  =  t2 
to  the  integral  of  2 tR(t2),  i.e.  to  the  integral  of  a  rational  function  of  t. 
This  method  of  integration  is  called  integration  by  rationalisation,  and 
is  of  extremely  wide  application. 

Its  application  to  the  problem  immediately  under  consideration  is  obvi¬ 
ous.  If  we  can  find  a  variable  t  such  that  x  and  y  are  both  rational  functions 
oft,  say  x  =  Ri(t),  y  =  Rail),  then 

J  R(x,y)dx=  I  R{R\(t),  R2(t)}  R[(t)  dt, 

and  the  latter  integral,  being  that  of  a  rational  function  oft,  can  be  calcu¬ 
lated  by  the  methods  of  §  130. 

It  would  carry  us  beyond  our  present  range  to  enter  upon  any  general 
discussion  as  to  when  it  is  and  when  it  is  not  possible  to  find  an  auxiliary 
variable  t  connected  with  x  and  y  in  the  manner  indicated  above.  We  shall 
consider  only  a  few  simple  and  interesting  special  cases. 

134.  Integrals  connected  with  conics.  Let  us  suppose  that 
x  and  y  are  connected  by  an  equation  of  the  form 

ax 2  +  2  hxy  +  by 2  +  2  gx  +  2  fy  +  c  =  0; 

in  other  words  that  the  graph  of  y,  considered  as  a  function  of  a;  is  a  conic. 
Suppose  that  (£,  rj)  is  any  point  on  the  conic,  and  let  x  —  f  =  X,  y  —  rj  =  Y. 
If  the  relation  between  x  and  y  is  expressed  in  terms  of  X  and  Y ,  it  assumes 
the  form 


aX 2  +  2  hXY  +  bY 2  +  2  GX  +  2  FY  =  0, 
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where  F  =  h^+brj+f,  G  =  a£+hr]+g.  In  this  equation  put  Y  =  tX.  It  will 
then  be  found  that  X  and  Y  can  both  be  expressed  as  rational  functions 
of  t,  and  therefore  x  and  y  can  be  so  expressed,  the  actual  formulae  being 

2  (G  +  Ft)  _  __2  t(G  +  Ft) 

a  +  2  ht  +  5f2  ’  ^  ^  a  +  2  ht  +  bt 2 

Hence  the  process  of  rationalisation  described  in  the  last  section  can  be 
carried  out. 

The  reader  should  verify  that 


hx  +  by  +  / 


—  |  (a  +  2  ht  +  bt2) 


dx 
dt  ’ 


so  that 


dx 


hx  +  by  +  / 


=  -2 


dt 

a  +  2  ht  +  bt 2 


When  h2  >  ab  it  is  in  some  ways  advantageous  to  proceed  as  follows. 
The  conic  is  a  hyperbola  whose  asymptotes  are  parallel  to  the  lines 


ax 2  +  2  hxy  +  by2  =  0, 


or 

b(y  -  fix)(y  -  fi'x)  =  0, 
say.  If  we  put  y  —  yx  =  t,  we  obtain 


y  —  /ax  —  t,  y  —  fix 


2  gx  +  2  fy  +  c 
bt 


and  it  is  clear  that  x  and  y  can  be  calculated  from  these  equations  as 
rational  functions  of  t.  We  shall  illustrate  this  process  by  an  application 
to  an  important  special  case. 
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f  dx 

135.  The  integral  /  .  Suppose  in  particular  that 

J  Vax 2  +  2  bx  +  c 

y 2  =  ax 2  +  2 bx  +  c,  where  a  >  0.  It  will  be  found  that,  if  we  put  y  +  xy/a  =  t, 
we  obtain 

dx  _  (t2  +  c)y/a  +  2bt  _  (t2  +  c)y/a  +  2bt 

dt  (ty/a  +  b)2  ’  ^  ty/a  +  b 


and  so 


xy fa  +  y  + 


(1) 


If  in  particular  a  =  1,  b  =  0,  c  =  a2,  or  a  =  1,  6  =  0,  c  =  —a2,  we  obtain 


dx 


\Jx2  +  a2 


=  log{x  +  \/x2  +  a2}, 


log  |x  +  v^2  —  a2 1,  (2) 


equations  whose  truth  may  be  verified  immediately  by  differentiation.  With 
these  formulae  should  be  associated  the  third  formula 

dx  . 

=  arc  sm(x/ a),  (3) 

a-  —  xz 

which  corresponds  to  a  case  of  the  general  integral  of  this  section  in  which  a  <  0. 
In  (3)  it  is  supposed  that  a  >  0;  if  a  <  0  then  the  integral  is  arc sin(x/|a|)  (cf. 
§  119).  In  practice  we  should  evaluate  the  general  integral  by  reducing  it  (as  in 
the  next  section)  to  one  or  other  of  these  standard  forms. 

The  formula  (3)  appears  very  different  from  the  formulae  (2):  the  reader  will 
hardly  be  in  a  position  to  appreciate  the  connection  between  them  until  he  has 
read  Ch.  X. 


136.  The  integral 


Xx  +  /i 


dx.  This  integral  can  be  inte- 


V ax2  +  2  bx  +  c 

grated  in  all  cases  by  means  of  the  results  of  the  preceding  sections.  It  is 
most  convenient  to  proceed  as  follows.  Since 


Xx  +  n  =  (A /a)  (ax  +  b)  +  /i  —  (A  b/a), 

f  ax  +  b  ^  ax2  _j_  2 bx  +  c, 
J  y/ax2  +  2bx  +  c 
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we  have 


(Ax  +  n)  dx 
\J  ax2  +  2  bx  +  c 


—  V  ax 2  +  2  bx  +  c  + 
a 


dx 

V  ax2  +  2  bx  +  c 


In  the  last  integral  a  may  be  positive  or  negative.  If  a  is  positive  we 
put  X\fa  +  (b/y/a)  =  t,  when  we  obtain 


i  r  dt 

yfa  J  Vt2  +  k  ’ 

where  n  =  (ac  —  b2)/a.  If  a  is  negative  we  write  A  for  —a  and  put 
x\J^A.  —  ( b / \/C4)  =  t,  when  we  obtain 


1 


dt 

\J —K  —  t 2 


It  thus  appears  that  in  any  case  the  calculation  of  the  integral  may  be 
made  to  depend  on  that  of  the  integral  considered  in  §  135,  and  that  this 
integral  may  be  reduced  to  one  or  other  of  the  three  forms 


dt 


dt 


dt 


Vo 


f 2 


same 


137.  The  integral  J  (Ax  +  /i)  V  ax2  +  2  bx  +  c  dx.  In  exactly  the 
way  we  find 

[  (Xx  +  VVax2  +  2bx  +  cdx 

(ax2  +  2bx  +  c)3/2  +  ^  J  V ax2  +  2 bx  +  cdx ; 

and  the  last  integral  may  be  reduced  to  one  or  other  of  the  three  forms 

In  order  to  obtain  these  integrals  it  is  convenient  to  introduce  at  this  point 
another  general  theorem  in  integration. 


A_ 

3a 
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138.  Integration  by  parts.  The  theorem  of  integration  by  parts  is 
merely  another  way  of  stating  the  rule  for  the  differentiation  of  a  product 
proved  in  §  113.  It  follows  at  once  from  Theorem  (3)  of  §  113  that 

J  f'(x)F(x)dx  =  f(x)F(x)  —  J  f(x)F'(x)dx. 

It  may  happen  that  the  function  which  we  wish  to  integrate  is  expressible 
in  the  form  f(x)F(x),  and  that  f(x)F'(x)  can  be  integrated.  Suppose, 
for  example,  that  (p(x)  =  xijj(x),  where  i])(x)  is  the  second  derivative  of  a 
known  function  x(x).  Then 

J  (f>(x)  dx  =  j  xx"{x)  dx  =  xx\x)  —  J  x'(x)  dx  =  xx(x)  —  x(x). 

We  can  illustrate  the  working  of  this  method  of  integration  by  applying  it 
to  the  integrals  of  the  last  section.  Taking 


we  obtain 


f(x)  =  ax  +  b,  F(x)  =  \J  ax 2  +  2  bx  +  c  =  y, 

a  J  ydx  =  (ax  +  b)y  —  J  (a:c  T  (jx 

/r 

ydx  +  (ac  —  b 2)  /  — , 

J  y 


so  that 


,  (ax  +  b)y  ac  —  b2 
ydx=  - - - 1- 


dx 

? 

y 


2  a  2  a 

and  we  have  seen  already  (§  135)  how  to  determine  the  last  integral. 
Examples  XLIX.  1.  Prove  that  if  a  >  0  then 

JVMdX  =  ixVx^  +  yioe{x+VM}, 

J  \J x2  —  a2  dx  =  |i\/x2-a2-  ^a2  log  \x  +  \/ x2  —  a2|, 
J  yj a2  —  x2  dx  =  \J a2  —  x2  +  \ a 2  arcsin(x/a). 
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2.  Calculate  the  integrals  f  .  '  =  ,  [  \/  a2  —  x2  dx  by  means  of  the 

J  s/a2  -  x2  J 

substitution  x  =  a  sin#,  and  verify  that  the  results  agree  with  those  obtained  in 
§  135  and  Ex.  1. 

3.  Calculate  J  x(x  +  a)m  dx,  where  m  is  any  rational  number,  in  three 

ways,  viz.  (i)  by  integration  by  parts,  (ii)  by  the  substitution  (x  +  a)m  =  t,  and 
(iii)  by  writing  (x  +  a)  —  a  for  x\  and  verify  that  the  results  agree. 

4.  Prove,  by  means  of  the  substitutions  ax  +  b  =  l/t  and  x  =  1  /it,  that  (in 
the  notation  of  §§  130  and  138) 


/ 


dx 

y3 


ax  +  b 
Ay 


bx  +  c 
Ay 


f  dx 

5.  Calculate  /  — ,  ,  where  b  >  a,  in  three  ways,  viz.  (i)  by  the 

J  y/[x-a){b-x) 

methods  of  the  preceding  sections,  (ii)  by  the  substitution  (6  —  x)/(x  —  a)  =  t2, 
and  (iii)  by  the  substitution  x  =  a  cos2  6  +  5  sin2#;  and  verify  that  the  results 
agree. 

6.  Integrate  \J (x  —  a) (b  —  x )  and  y/(b  —  x)/(x  —  a). 

7.  Show,  by  means  of  the  substitution  2 x  +  a  +  b  =  ^(a  —  b){t2  +  (l/t)2},  or 
by  multiplying  numerator  and  denominator  by  y/x  +  a  —  \J x  +  b,  that  if  a  >  b 
then 

r  (%% 

8.  Find  a  substitution  which  will  reduce  /  - - -77- - - - -tttx  to  the 

J  (x  +  a)3'2  +  (x  -  a)3/2 

integral  of  a  rational  function.  (Math.  Trip.  1899.) 

9.  Show  that  J  R{x,  \J ax  +  b}  dx  is  reduced,  by  the  substitution 
ax  +  b  =  yn,  to  the  integral  of  a  rational  function. 

10.  Prove  that 


f"(x)F(x )  dx  =  f'(x)F(x)  —  f(x)F'(x)  +  /  f(x)F"/x)  dx 


[VI :  139] 


DERIVATIVES  AND  INTEGRALS 


293 


and  generally 

J  f(jl\x)F{x)  dx 

=  f(-n~1\x)F{x )  -  f^-2\x)F'(x)  +  ■■■  +  (-! )n  J  f(x)F^n\x)  dx. 


11.  The  integral  J (1  +  x)pxq  dx,  where  p  and  q  are  rational,  can  be  found 

in  three  cases,  viz.  (i)  if  p  is  an  integer,  (ii)  if  q  is  an  integer,  and  (iii)  if  p  +  q  is 
an  integer.  [In  case  (i)  put  x  =  us,  where  s  is  the  denominator  of  q;  in  case  (ii) 
put  1  +  x  =  ts,  where  s  is  the  denominator  of  p;  and  in  case  (iii)  put  1  +  x  =  xts, 
where  s  is  the  denominator  of  p] 

12.  The  integral  J  xm(axn  +  b)q  dx  can  be  reduced  to  the  preceding  integral 

by  the  substitution  ax 71  =  bt.  [In  practice  it  is  often  most  convenient  to  calculate 
a  particular  integral  of  this  kind  by  a  ‘formula  of  reduction’  (cf.  Misc.  Ex.  39).] 

13.  The  integral  J  R{x,  Vax  +  b,  V^+~d}  dx  can  be  reduced  to  that  of  a 
rational  function  by  the  substitution 

4x  =  -( b/a){t  +  (1/t)}2  -  ( d/c){t  -  (1/t)}2. 

14.  Reduce  J  R(x,  y )  dx,  where  y2(x  —  y)  =  x2,  to  the  integral  of  a  rational 

function.  [Putting  y  =  tx  we  obtain  x  =  l/{t2(l  —  f)},  y  =  l/{f(l  —  t)}.] 

15.  Reduce  the  integral  in  the  same  way  when  (a)  y(x  —  y)2  =  x, 
( b )  (x2  +  y2)2  =  a2(x2  —  y2).  [In  case  (a)  put  x  —  y  =  t:  in  case  (b)  put 
x2  +  y2  =  t(x  —  y),  when  we  obtain 

x  =  a2t(t2  +  a2)/(t4  +  a4),  y  =  a2t(t2  —  a2)/(t4  +  a4).] 

16.  If  y(x  —  y)2  =  x  then 


17.  If  ( x 2  +  y2Y  =  2 c2(x2  —  y2)  then 


dx 


=  -3  lo§ 


y(x2  +  y2  +  c2)  c 2 


x2  +  y2\ 

x-y  J 
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139.  The  general  integral  J  R(x,  y )  dx,  where  y2  =  ax 2  +  2 bx  +  c. 

The  most  general  integral,  of  the  type  considered  in  §  134,  and  associated  with 
the  special  conic  y 2  =  ax 2  +  2 bx  +  c,  is 

J  R(x,y/X)dx,  (1) 

where  X  =  y2  =  ax2  +  2 bx  +  c.  We  suppose  that  R  is  a  real  function. 

The  subject  of  integration  is  of  the  form  P/Q ,  where  P  and  Q  are  polyno¬ 
mials  in  x  and  \[X .  It  may  therefore  be  reduced  to  the  form 


A  +  By/X  _{A  +  BjX){C-Dy/X)  _E  ^ 
c  +  dVx  c2-d2x 

where  A,  B,  ...  are  rational  functions  of  x.  The  only  new  problem  which  arises 
is  that  of  the  integration  of  a  function  of  the  form  Fy/X,  or,  what  is  the  same 
thing,  G/y/X,  where  G  is  a  rational  function  of  x.  And  the  integral 


/F1  (2) 


can  always  be  evaluated  by  splitting  up  G  into  partial  fractions.  When  we  do 
this,  integrals  of  three  different  types  may  arise. 

(i)  In  the  first  place  there  may  be  integrals  of  the  type 


/. rm 

7fdx'  (3) 


where  m  is  a  positive  integer.  The  cases  in  which  m  =  0  or  m  =  1  have  been 
disposed  of  in  §  136.  In  order  to  calculate  the  integrals  corresponding  to  larger 
values  of  m  we  observe  that 


=  (m  -  l)xm~2Vx  + 

dx 


(ax  +  b)xm  1 


axm  +  Pxm~l  +  yxm~2 


where  a,  /3,  7  are  constants  whose  values  may  be  easily  calculated.  It  is  clear 
that,  when  we  integrate  this  equation,  we  obtain  a  relation  between  three  suc¬ 
cessive  integrals  of  the  type  (3).  As  we  know  the  values  of  the  integral  for  m  =  0 
and  m  =  1,  we  can  calculate  in  turn  its  values  for  all  other  values  of  m. 
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In  the  second  place  there  may  be  integrals  of  the  type 

f  dx 


( X  —  p)m  y/X.' 


(4) 


where  p  is  real.  If  we  make  the  substitution  x  —  p  =  1/t  then  this  integral  is 
reduced  to  an  integral  in  t  of  the  type  (3). 

(iii)  Finally,  there  may  be  integrals  corresponding  to  complex  roots  of  the 
denominator  of  G.  We  shall  confine  ourselves  to  the  simplest  case,  that  in  which 
all  such  roots  are  simple  roots.  In  this  case  (cf.  §  130)  a  pair  of  conjugate  complex 
roots  of  G  gives  rise  to  an  integral  of  the  type 


Lx  +  M 


J  ( Ax 2  +  2 Bx  +  C)Vax2  +  2 bx  +  c 
In  order  to  evaluate  this  integral  we  put 

pt  +  v 


dx. 


(5) 


x  = 


t  +  1 


where  p  and  u  are  so  chosen  that 

apu  +  b(p  +  v)  +  c  =  0,  Apu  +  B{p  +  v)  +  C  =  0; 


so  that  p  and  v  are  the  roots  of  the  equation 

(aB  -  bA)i2  -  ( cA  -  aC)i  +  ( bC  -  cB)  =  0. 


This  equation  has  certainly  real  roots,  for  it  is  the  same  equation  as  equation  (1) 
of  Ex.  xlvi.  12;  and  it  is  therefore  certainly  possible  to  find  real  values  of  p  and  u 
fulfilling  our  requirements. 

It  will  be  found,  on  carrying  out  the  substitution,  that  the  integral  (5)  as¬ 
sumes  the  form 


H 


t  dt 

(at2  +  /?)  y/'ft2  +  5 


dt 

(at2  +  /3)  y/'ft2  +  5 


(6) 


The  second  of  these  integrals  is  rationalised  by  the  substitution 


t 

\/lt2  +  5 
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which  gives 


dt 


I 


du 


(at2  +  j3)  \J 'yt2  +  6  J  P  +  (a<5  —  Pj)u2 

Finally,  if  we  put  t  =  1/u  in  the  first  of  the  integrals  (6),  it  is  transformed  into 
an  integral  of  the  second  type,  and  may  therefore  be  calculated  in  the  manner 
just  explained,  viz.  by  putting  u/  y/'y  +  du2  =  u,  i.e.  l/\Z^t2  +  5  =  v* 


Examples  L.  1.  Evaluate 
f  dx  I 


dx 


dx 


J  xy/x2  +  2x  +  3’ 
2.  Prove  that 


(x  —  l)y/x2  +  1'  J  (x  +  l)Vl  +  2x  —  x2 


dx 


x  —  q 


J  (x  -  p)  y/(x  —  p)(x  —  q )  Q~P\  X  -  p 

3.  If  ag2  +  ch 2  =  —  v  <  0  then 


dx 


1 

= - =  arc  tan 


(hx  +  g)  V  ax2  +  c  yV 

dx 


yj  1 y(ax2  +  c) 


4.  Show  that  / 

J  (x  ~  x0)y 

one  or  other  of  the  forms 


ch  —  agx 

,  where  y2  =  ax 2  +  2 bx  +  c,  may  be  expressed  in 


— -log 
2/o 


axx  0  +  b(x  +  x0)  +  c  +  yy0 


x  —  Xq 


1  f  axx 0  +  b(x  +  xo)  +  c 

—  arc  tan  < - 

^0  l  yzo 


according  as  ax$  +  2bxo  +  c  is  positive  and  equal  to  y\  or  negative  and  equal 

to  —  Zq. 


*The  method  of  integration  explained  here  fails  if  a/A  =  b/B\  but  then  the  integral 
may  be  reduced  by  the  substitution  ax  +  b  =  t.  For  further  information  concern¬ 
ing  the  integration  of  algebraical  functions  see  Stolz,  Grundzuge  der  Differential-und- 
integralrechnung ,  vol.  i,  pp.  331  et  seq.;  Bromwich,  Elementary  Integrals  (Bowes  and 
Bowes,  1911).  An  alternative  method  of  reduction  has  been  given  by  Sir  G.  Greenhill: 
see  his  A  Chapter  in  the  Integral  Calculus ,  pp.  12  et  seq.,  and  the  author’s  tract  quoted 
on  p.  286. 
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5.  Show  by  means  of  the  substitution  y  =  \/ax2  +  2 bx  +  c/ (x  —  p )  that 

f  dx  _  f  dy 

J  (x  —  p)  \J ax2  +  2 bx  +  c  J  y7 A y2  —  p  ’ 

where  A  =  ap2  +  2bp  +  c,  p  =  ac  —  b2 .  [This  method  of  reduction  is  elegant  but 
less  straightforward  than  that  explained  in  §  139.] 

6.  Show  that  the  integral 

r  dx 
J  x\/3x2  +  2x  +  1 

is  rationalised  by  the  substitution  x  =  (1  +  y2)/(3  —  y2). 

7.  Calculate 

f  (x  +  1)  dx 


(Math.  Trip.  1911.) 


(x2  +  4)V^2  +  9 


8.  Calculate 


dx 


J  (hx2 +  \2x  +  3)V3xPr+2xzr7' 

[Apply  the  method  of  §  139.  The  equation  satisfied  by  /u  and  v  is 
£2  +  3£  +  2  =  0,  so  that  [i  =  —2,  v  =  —  1,  and  the  appropriate  substitution  is 
x  =  — (2t  +  l)/(f  +  1).  This  reduces  the  integral  to 

f  dt  f  tdt 


(4t2  +  1)  \/9t2  —  4  J  (At2  +  1)V9 12  -  4' 


The  first  of  these  integrals  may  be  rationalised  by  putting  f/\/9t2  —  4  =  u  and 
the  second  by  putting  1/ \/9t2  —  4  =  u.] 

9.  Calculate 


/ 


(2x2 


(x  +  1)  dx 

2x  +  l)V3x2  -  2x  +  l’ 


(x  —  1)  dx 

6x  +  5)\/7x2  -  22x  +  19 ' 


(Math.  Trip.  1911.) 

10.  Show  that  the  integral  J  R(x,  y)  dx ,  where  y2  =  ax 2  +  2bx  +  c,  is  ra¬ 
tionalised  by  the  substitution  t  =  (x  —  p)/(y  +  q),  where  (p,  q )  is  any  point  on 
the  conic  y2  =  ax 2  +  2bx  +  c.  [The  integral  is  of  course  also  rationalised  by  the 
substitution  t  =  (x  —  p) / (y  —  q):  cf.  §  134.] 
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140.  Transcendental  Functions.  Owing  to  the  immense  variety 
of  the  different  classes  of  transcendental  functions,  the  theory  of  their  in¬ 
tegration  is  a  good  deal  less  systematic  than  that  of  the  integration  of 
rational  or  algebraical  functions.  We  shall  consider  in  order  a  few  classes 
of  transcendental  functions  whose  integrals  can  always  be  found. 

141.  Polynomials  in  cosines  and  sines  of  multiples  of  x.  We 

can  always  integrate  any  function  which  is  the  sum  of  a  finite  number  of 
terms  such  as 

A  cosm  ax  sin7”'  ax  cosn  bx  sin"'  bx  ... , 

where  m,  m',  n,  n' ,  . . .  are  positive  integers  and  a,  b,  . . .  any  real  numbers 
whatever.  For  such  a  term  can  be  expressed  as  the  sum  of  a  finite  number 
of  terms  of  the  types 

a  cos{(pa  +  qb  +  . . .  )x},  /3  sin{(pa  +  qb  +  . . .  )x} 

and  the  integrals  of  these  terms  can  be  written  down  at  once. 

Examples  LI.  1.  Integrate  sin3  x  cos2  2x .  In  this  case  we  use  the  for¬ 
mulae 

sin3  x  =  \  (3  sin  x  —  sin  3x) ,  cos2  2x  =  \  (1  +  cos  4x) . 

Multiplying  these  two  expressions  and  replacing  sinxcos4x,  for  example,  by 
^  (sin  5x  —  sin  3x) ,  we  obtain 

Ys  J {7  sin  x  —  5  sin  3x  +  3  sin  5x  —  sin  7x)  dx 

=  —  Yg  COS  X  +  COS  3x  —  ^  COS  5x  +  yjrj  cos  7x. 

The  integral  may  of  course  be  obtained  in  different  forms  by  different  meth¬ 
ods.  For  example 

J  sin3  x cos2  2x  dx  =  J (4cos4  x  —  4cos2  x  +  1)(1  —  cos2  x)  sinx  dx, 
which  reduces,  on  making  the  substitution  cos  x  =  t,  to 

J (4f6  —  8 14  +  5 12  —  1)  dt  =  |  cos7  x  —  |  cos5  x  +  |  cos3  x  —  cos  x. 
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It  may  be  verified  that  this  expression  and  that  obtained  above  differ  only  by  a 
constant. 

2.  Integrate  by  any  method  cos  ax  cos  bx,  sin  ax  sin  bx,  cos  ax  sin  bx,  cos2  x, 
sin3  x,  cos4x,  cosxcos2xcos3x,  cos3  2x  sin2  3x,  cos5  x  sin'  x.  [In  cases  of  this 
kind  it  is  sometimes  convenient  to  use  a  formula  of  reduction  (Misc.  Ex.  39).] 


142.  The  integrals  J  xn  cos  xdx,  J  xn  sin  x  dx  and  associated 

integrals.  The  method  of  integration  by  parts  enables  us  to  generalise 
the  preceding  results.  For 


xn  cos  xdx  =  xn  sin  x  —  n  I  xn  1  sin  x  dx , 


J  x11  sin  x  dx ——xn  cos  x  +  n  J  xn  1  cos  xdx, 

and  clearly  the  integrals  can  be  calculated  completely  by  a  repetition  of 
this  process  whenever  n  is  a  positive  integer.  It  follows  that  we  can  always 

calculate  J  xn  cos  ax  dx  and  J  xn  sin  ax  dx  if  n  is  a  positive  integer;  and 

so,  by  a  process  similar  to  that  of  the  preceding  paragraph,  we  can  calculate 

Jp(X,cosaX,smaX,coste,sinte,...)dX, 

where  P  is  any  polynomial. 


Examples  LII.  1.  Integrate  xsinx,  x2cosx,  x2cos2x,  x2  sin2  x sin2  2x, 
x  sin2  x  cos4  x,  x3  sin3  lx. 

2.  Find  polynomials  P  and  Q  such  that 


J {(3x  —  1)  cosx  +  (1  —  2x)  sinx}  dx  =  P  cosx  +  Qsinx. 


3.  Prove  that  J  xn  cos  xdx  =  Pn  cos  x  +  Qn  sin  x,  where 
Pn  =  nxn~l  —  n(n  —  1  )(n  —  2)x"-3  +  . . . ,  Qn  =  xn  —  n(n  —  l)xn~2  +  . . . . 
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143.  Rational  Functions  of  cosx  and  sin  a;.  The  integral  of  any 
rational  function  of  cos  a:  and  sin  a;  may  be  calculated  by  the  substitution 
tan  \x  =  t.  For 


cos  a;  = 


1-t2 

TTt2’ 


2 1 

sins  =  rr?’ 


dx 

dt 


2 

TTt2’ 


so  that  the  substitution  reduces  the  integral  to  that  of  a  rational  function 
of  t. 


Examples  LIII.  1.  Prove  that 


J  sec  x  dx  =  log  |  sec  x  +  tan  x 


j  cosec  x  dx  =  log  |  tan  1 x 


[Another  form  of  the  first  integral  is  log|tan(^7r  +  ^x)\\  a  third  form  is 
\  log  |(1  +  sinx)/(l  —  sinx)|.] 

2.  /  tanxdx  =  —  log  |  cosx|,  /  cot  x  dx  =  log  |  sin  x| ,  /  sec2  x  dx  =  tan  a;, 


cosec  x  dx  =  —  cot  x,  /  tan  x  sec  x  dx  =  sec  x, 


cot  x  cosec  x  dx  =  —  cosec  x. 


[These  integrals  are  included  in  the  general  form,  but  there  is  no  need  to  use 
a  substitution,  as  the  results  follow  at  once  from  §  119  and  equation  (5)  of  §  130.] 

3.  Show  that  the  integral  of  1  /(a  +  b  cos  x) ,  where  a  +  b  is  positive,  may  be 
expressed  in  one  or  other  of  the  forms 


2 

V a2  —  b2 


arc  tan 


1 

yjb2  —  a 2 


log 


y/b  +  a  +  ty/b  —  a 
yfb  +  a  —  tyjb  —  a 


where  t  =  tan  ^x,  according  as  a2  >  b 2  or  a?  <b2 .  If  a 2  =  b2  then  the  integral 
reduces  to  a  constant  multiple  of  that  of  sec2  \x  or  cosec2  and  its  value  may 
at  once  be  written  down.  Deduce  the  forms  of  the  integral  when  a+b  is  negative. 

4.  Show  that  if  y  is  defined  in  terms  of  x  by  means  of  the  equation 

(a  +  b  cos  x)(a  —  b  cos  y)  =  a2  —  62 , 

where  a  is  positive  and  a2  >  b2,  then  as  x  varies  from  0  to  7r  one  value  of  y  also 
varies  from  0  to  n.  Show  also  that 

yj a2  —  b2  sin  y  sin  x  dx  sin  y 

a  —  b  cos  y  ’  a  +  b  cos  x  dy  a  —  b  cos  y  ’ 


sm  x  = 
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and  deduce  that  if  0  <  x  <  n  then 


dx 


1 


a  +  b  cos  x  y/a2  -  b 2 


arc  cos 


a  cos  x  +  b 
a  +  b  cos  x 


Show  that  this  result  agrees  with  that  of  Ex.  3. 

5.  Show  how  to  integrate  1/ (a  +  b  cos  x  +  c  sin  x) .  [Express  b  cos  x  +  c  sin  x 
in  the  form  \Jb2  +  c2  cos(x  —  «).] 

6.  Integrate  (a  +  bcosx  +  csinx)/(a  +  (3 cosx  +  7 sin x). 

[Determine  A,  //,  v  so  that 


a  +  b  cos  x  +  c  sin  x  =  \  +  /x(a  +  f3  cos  x  +  7  sin  x)  +  v(—j3  sin  x  +  7  cos  x) . 


Then  the  integral  is 


\ix  +  v  log  | a  +  /3  cos  x  +  7  sin  x\  +  A 


dx 

a  +  (3  cos  x  +  7  sin  x 


7.  Integrate  l/(a  cos2  x+2b  cosx  sin  x+c  sin2  x).  [The  subject  of  integration 
may  be  expressed  in  the  form  1  /(A  +  B  cos  2 x  +  C  sin  2x) ,  where  A  =  |  (a  +  c) , 
B  =  (a  —  c),  C  =  b:  but  the  integral  may  be  calculated  more  simply  by  putting 
tan  x  =  t,  when  we  obtain 


sec2  x dx 


dt 


a  +  2b  tan  x  +  c  tan2  x 


a  +  2  bt  +  ct 2 


144.  Integrals  involving  arc  sin  x,  arc  tan  x,  and  logic.  The  inte¬ 
grals  of  the  inverse  sine  and  tangent  and  of  the  logarithm  can  easily  be 
calculated  by  integration  by  parts.  Thus 


arc  sin  x  dx  —  x  arc  sin  x  — 
arc  tan  xdx  =  x  arc  tan  x  - 


x  dx 


x  dx 


x 


=  x  arc  sin  x  +  \/l  —  x2, 


J  1  +  x2 

log  x  dx  =  x  log  x  —  [  dx  =  x( logic  — 1). 


=  x  arc  tan  x  —  |  log(l  +  x2), 
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It  is  easy  to  see  that  if  we  can  find  the  integral  of  y  —  f(x)  then  we  can 
always  find  that  of  x  =  4>(y),  where  (j)  is  the  function  inverse  to  /.  For  on 
making  the  substitution  y  =  f(x)  we  obtain 


<j>(y)  dy=  xf'(x)  dx  =  xf(x) 


f(x)  dx. 


The  reader  should  evaluate  the  integrals  of  arc  sin  y  and  arc  tan  y  in  this 
way. 

Integrals  of  the  form 


J  P(x,  arc  sin  x)  dx , 


/pfck**)*, 


where  P  is  a  polynomial,  can  always  be  calculated.  Take  the  first 
form,  for  example.  We  have  to  calculate  a  number  of  integrals  of  the 

type  /  £m(arcsin  x)n  dx.  Making  the  substitution  x  =  siny,  we  obtain 


yn  sinm  y  cos  y  dy,  which  can  be  found  by  the  method  of  §  142.  In  the 
case  of  the  second  form  we  have  to  calculate  a  number  of  integrals  of  the 
type  /  xm(log x)n  dx.  Integrating  by  parts  we  obtain 


[  xm(log  x)ndx  =  ,r"l+1(IOg3:)’> - —  [  xm(\ogx)n~1dx, 

J  m  + 1  m  + 1  J 

and  it  is  evident  that  by  repeating  this  process  often  enough  we  shall  always 
arrive  finally  at  the  complete  value  of  the  integral. 


145.  Areas  of  plane  curves.  One  of  the  most  important  appli¬ 
cations  of  the  processes  of  integration  which  have  been  explained  in  the 
preceding  sections  is  to  the  calculation  of  areas  of  plane  curves.  Suppose 
that  PqPP'  (Fig.  44)  is  the  graph  of  a  continuous  curve  y  =  <p(x)  which 
lies  wholly  above  the  axis  of  x,  P  being  the  point  (x,  y)  and  P'  the  point 
(. x+h ,  y  +  k),  and  h  being  either  positive  or  negative  (positive  in  the  figure). 
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Fig.  44a. 


The  reader  is  of  course  familiar  with  the  idea  of  an  ‘area’,  and  in  par¬ 
ticular  with  that  of  an  area  such  as  ON  PP0.  This  idea  we  shall  at  present 
take  for  granted.  It  is  indeed  one  which  needs  and  has  received  the  most 
careful  mathematical  analysis:  later  on  we  shall  return  to  it  and  explain 
precisely  what  is  meant  by  ascribing  an  ‘area’  to  such  a  region  of  space 
as  ONPPq.  For  the  present  we  shall  simply  assume  that  any  such  region 
has  associated  with  it  a  definite  positive  number  ( ONPPq )  which  we  call 
its  area,  and  that  these  areas  possess  the  obvious  properties  indicated  by 
common  sense,  e.g.  that 

(. PRP ')  +  ( NN'RP )  =  (NN'P'P),  (N\NPP] )  <  ( ONPPq ), 
and  so  on. 

Taking  all  this  for  granted  it  is  obvious  that  the  area  ONPPq  is  a 
function  of  x ;  we  denote  it  by  <F(a;).  Also  <F(a;)  is  a  continuous  function. 
For 

®(x  +  h)  -  $(x)  =  ( NN'P'P ) 

=  (NN'RP)  +  (PRP')  =  h(j)(x)  +  (PRP'). 

As  the  figure  is  drawn,  the  area  PRP'  is  less  than  hk.  This  is  not 
however  necessarily  true  in  general,  because  it  is  not  necessarily  the  case 
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(see  for  example  Fig.  44a)  that  the  arc  PP'  should  rise  or  fall  steadily  from 
P  to  P' .  But  the  area  PRP'  is  always  less  than  \h\X(h),  where  A (h)  is  the 
greatest  distance  of  any  point  of  the  arc  PP'  from  PR.  Moreover,  since 
4>(x)  is  a  continuous  function,  X(h)  — *  0  as  h  — >  0.  Thus  we  have 

<h(x  +  h)  —  <h(x)  =  h{4>(x)  +  ii(h)}, 

where  \/a(h)\  <  X (h)  and  X(h)  — >  0  as  h  — >  0.  From  this  it  follows  at  once 
that  <F(x)  is  continuous.  Moreover 

$'(x)  =  lim  +  ^  -  $('r)  =  lim{0(x)  +  fi(h)}  =  <j>(x). 

h—>0  h  h-tO 

Thus  the  ordinate  of  the  curve  is  the  derivative  of  the  area,  and  the  area  is 
the  integral  of  the  ordinate. 

We  are  thus  able  to  formulate  a  rule  for  determining  the  area  ONPP0. 
Calculate  <F(x),  the  integral  of  4>{x).  This  involves  an  arbitrary  constant, 
which  we  suppose  so  chosen  that  <h(0)  =  0.  Then  the  area  required  is  <h(x). 

If  it  were  the  area  N\NPP\  which  was  wanted,  we  should  of  course  determine 
the  constant  so  that  4>(xi)  =  0,  where  x\  is  the  abscissa  of  Pi.  If  the  curve  lay 
below  the  axis  of  x,  4>(x)  would  be  negative,  and  the  area  would  be  the  absolute 
value  of  4>(a;). 

146.  Lengths  of  plane  curves.  The  notion  of  the  length  of  a  curve, 
other  than  a  straight  line,  is  in  reality  a  more  difficult  one  even  than  that 
of  an  area.  In  fact  the  assumption  that  PqP  (Fig.  44)  has  a  definite  length, 
which  we  may  denote  by  S(x),  does  not  suffice  for  our  purposes,  as  did  the 
corresponding  assumption  about  areas.  We  cannot  even  prove  that  S(x)  is 
continuous,  i.e.  that  lim{S'(P/)  —  S(P)}  =  0.  This  looks  obvious  enough 
in  the  larger  figure,  but  less  so  in  such  a  case  as  is  shown  in  the  smaller 
figure.  Indeed  it  is  not  possible  to  proceed  further,  with  any  degree  of 
rigour,  without  a  careful  analysis  of  precisely  what  is  meant  by  the  length 
of  a  curve. 

It  is  however  easy  to  see  what  the  formula  must  be.  Let  us  suppose 
that  the  curve  has  a  tangent  whose  direction  varies  continuously,  so  that 
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(f>'{x)  is  continuous.  Then  the  assumption  that  the  curve  has  a  length  leads 
to  the  equation 

{S(x  +  h)~  S(x)}/h  =  {PP'}/h  =  ( PP'/h )  x  ({PP'}/PP')} 
where  {PP1}  is  the  arc  whose  chord  is  PP'.  Now 

/  P2 

PP'  +  VPR 2  +  RP'2  =  hJ  1  +  — , 

and 

k  =  4>(x  +  h )  —  4>(x)  =  h(f)\£), 
where  £  lies  between  x  and  x  +  h.  Hence 

lim(PP/ /h)  =  lim  a/1  +  [</>'(£)]2  =  \/l  +  [4>'(x)]2. 

If  also  we  assume  that 

lim  {PP'}/PP'  =  1, 

we  obtain  the  result 

S\x)  =  lim{5'(x  +  h)  —  S(x)}/h  =  a/1  +  [(p'(x)}2 

and  so 

S(x)  —  j  a/1  +  [( t>'(x )]2  dx. 

Examples  LIV.  1.  Calculate  the  area  of  the  segment  cut  off  from  the 
parabola  y  =  x2/4 a  by  the  ordinate  x  =  £,  and  the  length  of  the  arc  which 
bounds  it. 

2.  Answer  the  same  questions  for  the  curve  ay 2  =  x3,  showing  that  the 
length  of  the  arc  is 


3.  Calculate  the  areas  and  lengths  of  the  circles  x2  +  y2  =  a2,  x2  +  y2  =  2 ax 
by  means  of  the  formulae  of  §§  145-146. 
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4.  Show  that  the  area  of  the  ellipse  (x2/a2)  +  (y2/b2)  =  1  is  irab. 

5.  Find  the  area  bounded  by  the  curve  y  =  sin  x  and  the  segment  of  the 
axis  of  x  from  x  =  0  to  x  =  2ir.  [Here  <b(x)  =  —  cosx,  and  the  difference  between 
the  values  of  —  cosx  for  x  =  0  and  x  =  2ir  is  zero.  The  explanation  of  this  is 
of  course  that  between  x  =  ir  and  x  =  2ir  the  curve  lies  below  the  axis  of  x, 
and  so  the  corresponding  part  of  the  area  is  counted  negative  in  applying  the 
method.  The  area  from  x  =  0  to  x  =  n  is  —  cos7r  +  cosO  =  2;  and  the  whole 
area  required,  when  every  part  is  counted  positive,  is  twice  this,  i.e.  is  4.] 

6.  Suppose  that  the  coordinates  of  any  point  on  a  curve  are  expressed  as 
functions  of  a  parameter  t  by  equations  of  the  type  x  =  4>(t),  y  =  ip(t),  0  and  ^ 
being  functions  of  t  with  continuous  derivatives.  Prove  that  if  x  steadily  increases 
as  t  varies  from  to  to  t±,  then  the  area  of  the  region  bounded  by  the  corresponding 
portion  of  the  curve,  the  axis  of  x,  and  the  two  ordinates  corresponding  to  to 
and  fi,  is,  apart  from  sign,  A(H)  —  A(to),  where 

A{t)  =  J  il>{t)4>  (t)  dt  =  J  y  ^  dt. 

7.  Suppose  that  C  is  a  closed  curve  formed  of  a  single  loop  and  not  met 
by  any  parallel  to  either  axis  in  more  than  two  points.  And  suppose  that  the 
coordinates  of  any  point  P  on  the  curve  can  be  expressed  as  in  Ex.  6  in  terms 
of  t.  and  that,  as  t  varies  from  to  to  t\,  P  moves  in  the  same  direction  round  the 
curve  and  returns  after  a  single  circuit  to  its  original  position.  Show  that  the 
area  of  the  loop  is  equal  to  the  difference  of  the  initial  and  final  values  of  any 
one  of  the  integrals 


-JyTtdt’  Ix‘^dt’  5 


dy  dx\  , 

V  -  ^ ' dt ■ 


this  difference  being  of  course  taken  positively. 

8.  Apply  the  result  of  Ex.  7  to  determine  the  areas  of  the  curves  given  by 


x  1  —  tz 
(,)  ;  = 


y 

a 


2 1 


1  +  t2’ 


(ii)  x  =  acos3f,  y  =  6  sin3 1. 


9.  Find  the  area  of  the  loop  of  the  curve  x3  +  y3  =  3 axy.  [Putting  y  =  tx 
we  obtain  x  =  3 at/ (1  + 13),  y  =  3 at2 j (1  + 13).  As  t  varies  from  0  towards  oo  the 
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loop  is  described  once.  Also 


1 

2 


dx  dy  \  ^ 

v— - x—  at 

y  dt  dt 


1 

2 


x2-  (V) 
dt\x) 


dt 


1  I"  9  a2t2  3a2 

2  J  (Tm 3)2  =  2(1  + *3) 


which  tends  to  0  as  t  oo.  Thus  the  area  of  the  loop  is  §a2.] 

10.  Find  the  area  of  the  loop  of  the  curve  x5  +  yr>  =  5ax2y2. 

11.  Prove  that  the  area  of  a  loop  of  the  curve  x  =  asin2f,  y  =  a  sin  t  is  |a2. 

(Math.  Trip.  1908.) 

12.  The  arc  of  the  ellipse  given  by  x  =  a  cost,  y  =  6sint,  between  the  points 
t  =  t\  and  t  =  t2,  is  Ffo)  —  F(ti),  where 


F(t)  =  a  j  \/l  —  e2  sin2  t  dt, 

e  being  the  eccentricity.  [This  integral  cannot  however  be  evaluated  in  terms  of 
such  functions  as  are  at  present  at  our  disposal.] 

13.  Polar  coordinates.  Show  that  the  area  bounded  by  the  curve  r  =  f(0), 
where  f(6)  is  a  one- valued  function  of  0,  and  the  radii  0  =  B\,  9  =  O2,  is 

F(6 2)  —  F(9\),  where  F(9)  =  h  J 7’2  d9.  And  the  length  of  the  corresponding 
arc  of  the  curve  is  <h(02)  —  $(0i),  where 


d9. 


Hence  determine  (i)  the  area  and  perimeter  of  the  circle  r  =  2a  sin  0;  (ii)  the 
area  between  the  parabola  r  =  sec2  \9  and  its  latus  rectum,  and  the  length  of 
the  corresponding  arc  of  the  parabola;  (iii)  the  area  of  the  limagon  r  =  a+6cos  9, 
distinguishing  the  cases  in  which  a  >  b,  a  =  b,  and  a  <  6;  and  (iv)  the  areas  of 
the  ellipses  1  / r 2  =  a  cos2  9  +  2h  cos  9  sin  9  +  b  sin2  9  and  l/r  =  1  +  e  cos  9.  [In  the 

/dQ 

7 - --77,  which  may  be  calculated 

(1  +  ecosflR ' 

(cf.  Ex.  Lin.  4)  by  the  help  of  the  substitution 

(1  +  ecos0)(l  —  ecos<p)  =  1  —  e2.] 


14.  Trace  the  curve  29  =  (a/r)  +  (r/ a),  and  show  that  the  area  bounded  by 
the  radius  vector  9  =  [3,  and  the  two  branches  which  touch  at  the  point  r  =  a, 
9  =  1,  is  §a2(/32  —  l)3/2.  (Math.  Trip.  1900.) 
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15.  A  curve  is  given  by  an  equation  p  =  f(r),  r  being  the  radius  vector  and 
p  the  perpendicular  from  the  origin  on  to  the  tangent.  Show  that  the  calculation 
of  the  area  of  the  region  bounded  by  an  arc  of  the  curve  and  two  radii  vectores 

depends  upon  that  of  the  integral  \ 

MISCELLANEOUS  EXAMPLES  ON  CHAPTER  VI. 

1.  A  function  f(x)  is  defined  as  being  equal  to  1  +  x  when  x  ^  0,  to  x  when 

0  <  x  <  1,  to  2  —  x  when  1  ^  x  ^  2,  and  to  3x  —  x2  when  x  >  2.  Discuss  the 
continuity  of  f(x)  and  the  existence  and  continuity  of  f(x)  for  x  =  0,  x  =  1, 
and  x  =  2.  (Math.  Trip.  1908.) 

2.  Denoting  a,  ax  +  b,  ax2  +  2 bx  +  c,  . . .  by  uq,  u\,  112,  ■  ■  ■ ,  show  that 
UqU3  —  3uqU\U2  +  2 u\  and  uqu^  —  Auiu^  +  3u2  are  independent  of  x. 

3.  If  ao,  aq,  •  •  • ,  a2n  are  constants  and  Ur  =  (ao,  a\, . . . ,  ar  jj  x,  l)r,  then 

U0U211  —  2nlhU2n-l  +  ^ — U2U2n-2  —  •  •  •  +  U211U0 

is  independent  of  x.  (Math.  Trip.  1896.) 

[Differentiate  and  use  the  relation  U'r  =  rUr-\.\ 

4.  The  first  three  derivatives  of  the  function  arc  sin(/i  sin  x)  —  x,  where 
/i  >  1,  are  positive  when  0  ^  x  ^  \tt. 

5.  The  constituents  of  a  determinant  are  functions  of  x.  Show  that  its 
differential  coefficient  is  the  sum  of  the  determinants  formed  by  differentiating 
the  constituents  of  one  row  only,  leaving  the  rest  unaltered. 

6.  If  /i,  /2,  fy,  R  are  polynomials  of  degree  not  greater  than  4,  then 

fl  f'2  f'3  A 

fi  f^,  n  n 

fir  rll  fU  fll 

1 1  /  2  /  3  J  4 

fin  fin  fill  fin 

J 1  J  2  J  3  J  4 

is  also  a  polynomial  of  degree  not  greater  than  4.  [Differentiate  five  times,  using 
the  result  of  Ex.  5,  and  rejecting  vanishing  determinants.] 

7.  If  y 3  +  3 yx  +  2x3  =  0  then  x2(\  +  x3)y"  —  | xy'  +  y  =  0. 

(Math.  Trip.  1903.) 
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8.  Verify  that  the  differential  equation  y  =  (j>{if(yi)}  +  <f>{x  —  f>(yi)},  where 
yi  is  the  derivative  of  y,  and  if  is  the  function  inverse  to  <//,  is  satisfied  by 
y  =  4>{c)  +  4>(x  —  c)  or  by  y  =  2< f>(\x). 

9.  Verify  that  the  differential  equation  y  =  {x/V’(yi)}</>{V’(2/i)})  where  the 
notation  is  the  same  as  that  of  Ex.  8,  is  satisfied  by  y  =  c<f{x/c)  or  by  y  =  f3x, 
where  /3  =  f(a)/a  and  a  is  any  root  of  the  equation  <j)(a )  —  a<j)'{a)  =  0. 

10.  If  ax  +  by  +  c  =  0  then  y2  =  0  (suffixes  denoting  differentiations  with  re¬ 
spect  to  x) .  We  may  express  this  by  saying  that  the  general  differential  equation 
of  all  straight  lines  is  y2  =  0.  Find  the  general  differential  equations  of  (i)  all 
circles  with  their  centres  on  the  axis  of  x,  (ii)  all  parabolas  with  their  axes  along 
the  axis  of  x,  (iii)  all  parabolas  with  their  axes  parallel  to  the  axis  of  y,  (iv)  all 
circles,  (v)  all  parabolas,  (vi)  all  conics. 

[The  equations  are  (i)  1  +  y\  +  yy2  =  0,  (ii)  y\  +  yy2  =  0,  (iii)  y3  =  0, 
(iv)  (1  +  yf)y3  =  32/i2/|,  (v)  =  32/22/4,  (vi)  9 y\y3  -  45y2ym  +  40 y\  =  0. 

In  each  case  we  have  only  to  write  down  the  general  equation  of  the  curves  in 
question,  and  differentiate  until  we  have  enough  equations  to  eliminate  all  the 
arbitrary  constants.] 

11.  Show  that  the  general  differential  equations  of  all  parabolas  and  of  all 
conics  are  respectively 

Dl(yf2/ 3)  =  0,  D3x(y- 2/3)  =  0. 

[The  equation  of  a  conic  may  be  put  in  the  form 

y  =  ax  +  b  ±  \J px2  +  2 qx  +  r. 


From  this  we  deduce 


2/2 


=  ±(pr  —  q2)/(px 2  +  2  qx  +  r)3^2. 


If  the  conic  is  a  parabola  then  p  =  0.] 

^  dy  1  d2y  1  d3y  1  dAy 

12 '  Den0tmg  Wr  2!  3!  4!  Jj'  "  by  *'  c' 

1  t  1  t  1  d ^  t 

2!^'3!d?'4!  V""  byT’“’  ^  7’ '  ■ '  ’  Sh°W  “10‘ 


dx 

and  — , 
dy 


4ac  —  5  b2  =  (4a7  —  5/32)/r8,  bt  —  az  =  —{fir  —  a2)/rG 
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Establish  similar  formulae  for  the  functions  a2d  —  3abc  —  2 63,  (1  +  t2)b  —  2 a2t, 
2  ct  —  5ab. 

13.  Prove  that,  if  yk  is  the  /ctli  derivative  of  y  =  sin(n  arc  sin  x),  then 

(1  -  x2)yk+ 2  -  (2 k  +  l)xyk+i  +  ( n 2  -  k2)yk  =  0. 

[Prove  first  when  k  =  0,  and  differentiate  k  times  by  Leibniz’  Theorem.] 

14.  Prove  the  formula 

vDnxu  =  D™(uv)  -  nD'i~\uDxv)  +  Dnx~2  (uD2xv)  -... 

where  n  is  any  positive  integer.  [Use  the  method  of  induction.] 

15.  A  curve  is  given  by 

x  =  a(2cosf  +  cos2f),  y  =  a(2  sin  t  —  sin  2t). 

Prove  (i)  that  the  equations  of  the  tangent  and  normal,  at  the  point  P  whose 
parameter  is  t,  are 

x  sin  1 1  +  y  cos  =  a  sin  1 1,  xcos  \t  —  y  sin  1 1  =  3a  cos  §t; 

(ii)  that  the  tangent  at  P  meets  the  curve  in  the  points  Q,  R  whose  parameters 
are  —\t,  and  7 r  —  (iii)  that  QR  =  4a;  (iv)  that  the  tangents  at  Q  and  R  are 
at  right  angles  and  intersect  on  the  circle  x2  +  y2  =  a2;  (v)  that  the  normals  at 
P ,  Q ,  and  R  are  concurrent  and  intersect  on  the  circle  x2  +  y2  =  9a2;  (vi)  that 
the  equation  of  the  curve  is 

(x2  +  y2  +  12  ax  +  9a2)2  =  4a(2.x  +  3a)3. 

Sketch  the  form  of  the  curve. 

16.  Show  that  the  equations  which  define  the  curve  of  Ex.  15  may  be  replaced 
by  £/a  =  2a  +  (l/it2),  77/a  =  (2/a)  +  a2,  where  £  =  x  +  yi,  rj  =  x  —  yi,  a  =  Cist. 
Show  that  the  tangent  and  normal,  at  the  point  defined  by  a,  are 

a2£  —  uy  =  a(u 3  —  1),  a2£  +  uy  =  3a(a3  +  1), 

and  deduce  the  properties  (ii)-(v)  of  Ex.  15. 

17.  Show  that  the  condition  that  x4  +  4 px3  —  4 qx  —  1  =  0  should  have  equal 
roots  may  be  expressed  in  the  form  (p  +  q )2/3  —  (p  —  q )2^3  =  1. 

(Math.  Trip.  1898.) 
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18.  The  roots  of  a  cubic  f(x)  =  0  are  a,  /3,  7  in  ascending  order  of  magnitude. 

Show  that  if  [a,  /3]  and  [/3, 7]  are  each  divided  into  six  equal  sub-intervals,  then 
a  root  of  ,f  (x)  =  0  will  fall  in  the  fourth  interval  from  /3  on  each  side.  What 
will  be  the  nature  of  the  cubic  in  the  two  cases  when  a  root  of  f'(x )  =  0  falls  at 
a  point  of  division?  (Math.  Trip.  1907.) 

19.  Investigate  the  maxima  and  minima  of  f(x),  and  the  real  roots  of 
/( x)  =  0,  f(x)  being  either  of  the  functions 

x  —  sinx  —  tana(l  —  cosx),  x  —  shirr  —  (a  —  sina)  —  tan  ^(cosa  —  cosx), 


and  a  an  angle  between  0  and  n.  Show  that  in  the  first  case  the  condition  for  a 
double  root  is  that  tana  —  a  should  be  a  multiple  of  -k. 

20.  Show  that  by  choice  of  the  ratio  A  :  y  we  can  make  the  roots  of 
A(ax2  +  bx  +  c)  +  y(a'x 2  +  b'x  +  c')  =  0  real  and  having  a  difference  of  any 
magnitude,  unless  the  roots  of  the  two  quadratics  are  all  real  and  interlace; 
and  that  in  the  excepted  case  the  roots  are  always  real,  but  there  is  a  lower 
limit  for  the  magnitude  of  their  difference.  (Math.  Trip.  1895.) 

[Consider  the  form  of  the  graph  of  the  function  (ax2 +bx+c)  /  (a' x2  +  b' x+d)\ 
cf.  Exs.  XLVI.  12  et  seq .] 


21.  Prove  that 


sin  ttx 
x(l  —  x) 


<  4 


when  0  <  x  <  1,  and  draw  the  graph  of  the  function. 

22.  Draw  the  graph  of  the  function 


7 r  cot  nx 


1 

X 


1 

x  —  1  ' 


23.  Sketch  the  general  form  of  the  graph  of  y ,  given  that 


dy  (Qx2  +  x  —  l)(x  —  \)2(x  +  l)3 
dx  x2 


(Math.  Trip.  1908.) 

24.  A  sheet  of  paper  is  folded  over  so  that  one  corner  just  reaches  the  opposite 
side.  Show  how  the  paper  must  be  folded  to  make  the  length  of  the  crease  a 
maximum. 

25.  The  greatest  acute  angle  at  which  the  ellipse  (x2/a2)  +  ( y2/b 2)  =  1  can 
be  cut  by  a  concentric  circle  is  arctan{(a2  —  b2)/2ab}.  (Math.  Trip.  1900.) 
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26.  In  a  triangle  the  area  A  and  the  semi-perimeter  s  are  fixed.  Show 
that  any  maximum  or  minimum  of  one  of  the  sides  is  a  root  of  the  equation 
s(x  —  s)x2  +  4A2  =  0.  Discuss  the  reality  of  the  roots  of  this  equation,  and 
whether  they  correspond  to  maxima  or  minima. 

[The  equations  a  +  b  +  c  =  2s,  s(s  —  a)(s  —  b)(s  —  c)  =  A2  determine  a  and  b 
as  functions  of  c.  Differentiate  with  respect  to  c,  and  suppose  that  da/dc  =  0. 
It  will  be  found  that  b  =  c,  s  —  b  =  s  —  c  =  ^ a ,  from  which  we  deduce  that 
s(a  —  s)a2  +  4A2  =  0. 

This  equation  has  three  real  roots  if  s4  >  27A2,  and  one  in  the  contrary 
case.  In  an  equilateral  triangle  (the  triangle  of  minimum  perimeter  for  a  given 
area)  s4  =  27A2;  thus  it  is  impossible  that  s4  <  27A2.  Hence  the  equation  in  a 
has  three  real  roots,  and,  since  their  sum  is  positive  and  their  product  negative, 
two  roots  are  positive  and  the  third  negative.  Of  the  two  positive  roots  one 
corresponds  to  a  maximum  and  one  to  a  minimum.] 


27.  The  area  of  the  greatest  equilateral  triangle  which  can  be  drawn  with 
its  sides  passing  through  three  given  points  A,  B,  C  is 


2A  + 


+  b2  +  , 
2^/3 


a,  b,  c  being  the  sides  and  A  the  area  of  ABC.  (Math.  Trip.  1899.) 

28.  If  A,  A'  are  the  areas  of  the  two  maximum  isosceles  triangles  which 

can  be  described  with  their  vertices  at  the  origin  and  their  base  angles  on  the 
cardioid  r  =  a(l  +  cos$),  then  256AA'  =  25a4 a/5.  (Math.  Trip.  1907.) 

29.  Find  the  limiting  values  which  (x2  —  4y  +  8)/ ( y 2  —  6x  +  3)  approaches  as 

the  point  ( x ,  y)  on  the  curve  x2y  —  4x2  —  4 xy  +  y2  +  16x  —  2y  —  7  =  0  approaches 
the  position  (2,3).  (Math.  Trip.  1903.) 

[If  we  take  (2, 3)  as  a  new  origin,  the  equation  of  the  curve  becomes 
£277  —  £ 2  +  rj2  =  0,  and  the  function  given  becomes  (£2  +  4£  —  4r/) / (rf2  +  6?y  —  6£) . 
If  we  put  rj  =  t£,  we  obtain  £  =  (1  —  t2)/t,  rj  =  1  —  t2.  The  curve  has  a  loop 
branching  at  the  origin,  which  corresponds  to  the  two  values  t  =  —  1  and  t  =  1. 
Expressing  the  given  function  in  terms  of  t,  and  making  t  tend  to  —1  or  1,  we 
obtain  the  limiting  values  —  §,  —  |.] 

30.  If  f(x)  = - 7 - r - ,  then 

sin  x  —  sm  a  (x  —  a)  cos  a 


—  {lim  f(x)}  —  lim  f'(x)  =  |  sec3  a  —  A  sec  a. 
da  x^a  x^ra  ^  ±z 


(Math.  Trip.  1896.) 
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31.  Show  that  if  < j>(x)  =  1/(1  +  x 2)  then  4>n(x)  =  Qn(x)/(  1  +  x2)n+1,  where 
Qn(x )  is  a  polynomial  of  degree  n.  Show  also  that 

(i)  Qn+i  =  (1  +  x2)Q'n  -  2(n  +  l)xQn, 

(ii)  Qn+ 2  +  2  (n  +  2)xQn+i  +  (n  +  2)(n  +  1)(1  +  x2)Qn  =  0, 

(iii)  (1  +  x2)Q"%  —  2  nxQ'n  +  n(n  +  1  )Qn  =  0, 

(iv)  Qn  =  (-1  )nn\  |(n  +  l)xn  -  (n  +  +  . . .  j, 

(v)  all  the  roots  of  Qn  =  0  are  real  and  separated  by  those  of  Qn-i  =  0. 

32.  If  f(x),  4>(x),  have  derivatives  when  a  ^  x  S  b,  then  there  is  a 

value  of  £  lying  between  a  and  b  and  such  that 


/(a)  (j>{a)  i jj{a) 

f(b)  <f>(b)  tp(b) 

no  no  no 


=  0. 


[Consider  the  function  formed  by  replacing  the  constituents  of  the  third  row 
by  f(x),  4>(x),  'ip(x).  This  theorem  reduces  to  the  Mean  Value  Theorem  (§  125) 
when  4>(x)  =  x  and  ip(x)  =  1.] 

33.  Deduce  from  Ex.  32  the  formula 

m  -  f(a)  no 

<t>{b)  -<t>(a)  no’ 

34.  If  4>'(x)  ->  a  as  x  — >  oo,  then  cj>(x)/x  — >  a.  If  4>'(x)  oo  then  <f>(x)  — >•  oo. 
[Use  the  formula  <j>(x)  —  4>(xq)  =  (x  —  xo)(fJ(Oi  where  xo  <  £  <  x.] 

35.  If  c A(x )  — >  a  as  x  — >  oo,  then  c ft(x )  cannot  tend  to  any  limit  other  than 
zero. 

36.  If  4>(x)  +  o’ (x)  — >  a  as  x  — >  oo,  then  cj>(x)  — >  a  and  <f>'(x)  — >  0. 

[Let  (j>{x)  =  a  +  ift(x),  so  that  ^(x)  +  ip'(x)  — >  0.  If  ip'(x)  is  of  constant  sign, 
say  positive,  for  all  sufficiently  large  values  of  x,  then  iJj(x)  steadily  increases 
and  must  tend  to  a  limit  l  or  to  oo.  If  ^(x)  — >  oo  then  ip'(x)  — >  — oo,  which 
contradicts  our  hypothesis.  If  ip(x)  l  then  ip'(x)  — >  —l,  and  this  is  impossible 
(Ex.  35)  unless  l  =  0.  Similarly  we  may  dispose  of  the  case  in  which  tp'(x)  is 
ultimately  negative.  If  ip{x)  changes  sign  for  values  of  x  which  surpass  all  limit, 
then  these  are  the  maxima  and  minima  of  il>(x).  If  x  has  a  large  value  corre¬ 
sponding  to  a  maximum  or  minimum  of  x),  then  ip(x)  +  i/j'(x)  is  small  and 
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ip'(x)  =  0,  so  that  if(x)  is  small.  A  fortiori  are  the  other  values  of  if(x)  small 
when  x  is  large. 


For  generalisations  of  this  theorem,  and  alternative  lines  of  proof,  see  a  paper 
by  the  author  entitled  “Generalisations  of  a  limit  theorem  of  Mr  Mercer,”  in 
volume  43  of  the  Quarterly  Journal  of  Mathematics.  The  simple  proof  sketched 
above  was  suggested  by  Prof.  E.  W.  Hobson.] 


37.  Show  how  to  reduce  /  R  ■ 


ax  +  b  cx  +  d 


mx  +  n  '  V  mx  +  n 


dx  to  the  integral 


of  a  rational  function.  [Put  mx  +  n  =  1/t  and  use  Ex.  XLIX.  13.] 


38.  Calculate  the  integrals: 


dx 


(1  +  x2)3  : 


x  —  1  dx 


x  dx 


X  +  l  x’  J  y/l  +  x  -  y/l  +  x’ 
f  5  cos  x  +  6 


dx 


I  2  cos  x  +  sin  x  +  3 
cos  x  sin  x  dx 


dx , 

cosec  x  y/  sec  2x  dx , 


(2  —  sin2  x)(2  +  sinx  —  sin2  x)  ’  J  cos4x  +  sin4x’  j 

dx  f  x  +  sin  x  ,  f  ,  f  .2 

— ,  - .  /  - dx,  /  arcsecxax,  /  (arcsmx)  ax, 

v/(l  +  sinx)(2  +  sinx)  J  1  +  cosx  J  J 


/  x arcsmx 

x  arcsmx  ax,  /  —  ax, 


arc  sin  x 


y/l  —  x2 


dx, 


arc  sin  x 


(1  +  x)5 


dx, 


arc  tan  x 


dx, 


arc  tan  x 


(1  +  x2)3/2 


dx, 


x 

log(a2  +  /32x2)  I  log  (a  +  fix)  ^ 


(a  +  bx)2 


39.  Formulae  of  reduction,  (i)  Show  that 


2(„  -  1)(,  -  1/)  j  (x2+^  +  q}n 

=  ,  2  X-Al-  +  (2n  —  3)  /  ^  , 

(x2 +px  +  (/)n_i  J  (x2+px  +  q)n~1 
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[Put  x  +  \p  =  t,  q  —  1  p2  =  A:  then  we  obtain 


dt 


1 


dt 


1 


t 2  dt 


(t2  +  A)n  A  J  (t2  +  A)n_1  A  J  (t2  +  A) 
If  dt  +  1  f 


A  J  (t2  +  X)11-1  2A(n  -  1) 


ldt  {  (f2  +  A)”"1 } 


and  the  result  follows  on  integrating  by  parts. 

A  formula  such  as  this  is  called  a  formida  of  reduction.  It  is  most  useful 

r 

when  n  is  a  positive  integer.  We  can  then  express  /  — -r - in  terms  of 

J  ( x2+px  +  q)n 

r  djf 

/  — — 2 - -  1 ,  and  so  evaluate  the  integral  for  every  value  of  n  in  turn.] 

J  (x  T  px  +  Q)'1 

(ii)  Show  that  if  Ip.q  =  f  xp(l  +  x)q  dx  then 


(p  +  1  )IPtg  =  xp+1{l  +  x)q  -  qlp+i,q- 1, 

and  obtain  a  similar  formula  connecting  Ip,q  with  /p_ ii9+i.  Show  also,  by  means 
of  the  substitution  x  =  —y/(  1  +  y),  that 

Ip,q  =  (-l)p+1  J  yp(l  +  y)-p-q-2dy. 


(iii)  Show  that  if  X  =  a  +  bx  then 


J  xX~1/3dx 
J  x2X~l/3dx 

J  xX-^dx 

J  x2X~1/*dx 


-3(3 a  -  2te)V2/3/1062, 

3(9a2  -  6 abx  +  5b2x2)X2/3 /40b3 , 

— 4(4o  —  3bx)X3^/21b2, 

4(32 a2  -  24a6x  +  21&V)V3/4/23163. 


(iv)  If  7m,n  = 


xm  dx 
(1  +  x2)n 


then 


2 (n  -  1  )Im,n  =  ~Xm  X(1  +  X2)  (n  +  (m,  -  l)/m_ 2,n— 1- 
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(v)  If  In  =  J  x"  cos  fix  dx  and  Jn  =  j  xn  sin  fix  dx  then 

filn  =  xn  sin  fix  —  nJn- 1,  fiJn  =  —  xn  cos  fix  +  n/n_i. 

(vi)  If  In  =  J  cos"  x  dx  and  Jn  =  j  sin"  x  dx  then 


nln  =  sin  x  cos"  1  x  +  (n  —  l)Zn,_2,  n  Jn  =  —  cos  a;  sin"  1  x  +  (n  —  1)  Jn-2- 

(vii)  If  In  =  j  tan n  xdx  then  (n  —  1  )(/n  +  In-2)  =  tan"-1  x. 

(viii)  If  Im>n  =  y  cosm  x  sin"  x  dx  then 

(m  +  n)Im,n  =  -  cosm+1  x  sin"-1  x  +  (n  -  l)/mjn_2 
=  cos’"-1  x  sin"+1  x  +  (m  —  l)/m_2,n. 

[We  have 

(m  +  1  )Im  n  =  ~  [  sin"-1  x— (cosm+1  x)  dx 
J  dx 

=  —  cosm+1  x  sin"-1  x  +  (n  —  1)  J  cosm+2  x  sin"-2  x  dx 
=  -  cosm+1  x  sin"-1  X  +  (n  -  l)(/m!n_2  - 

which  leads  to  the  first  reduction  formula.] 

(ix)  Connect  /m>n  =  J  sinm  x  sin  nx  dx  with  Im-2,n-  (Math.  Trip.  1897. 

(x)  If  Irn;n  =  J  xm  cosec"  x  dx  then 

(n  -  l)(n  -  2)/m>n  =  (n  -  2 )2/m,n-2  +  m(m  -  1  )/m_2i„_2 

—  x”t-1  cosec"-1  x{m  sin  x  +  (n  —  2)xcosx}. 


(Math.  Trip.  1896.) 
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(xi)  If  In  =  j  (a  +  b  cos  x)  n  dx  then 

(n  —  l)(a2  —  b2)In  =  —  6sinx(a  +  6cosx)_<'n~1^  +  (2  n  —  3)a/n_i  —  (n  —  2  )In- 

(xii)  If  In  =  J  (a  cos2  x  +  2  h  cos  x  sin  x  +  b  sin2  x)~n  dx  then 


4n(n  +  1  )(ab  —  h2)In+2  -  2n(2n  +  l)(a  +  b)In+ \  +  4 n2In  =  - 


d2In 


dx2  ' 

(Math.  Trip.  1898.) 


(xiii)  If  Im,n  =  /  xm(logx)n  dx  then 


(m  +  1  )Im,n  =  x  (logx)n  -  nIm)U- 1- 

40.  If  n  is  a  positive  integer  then  the  value  of  J  xm(logx)n  dx  is 


x 


m+ 1  f  (logx)n  n(logx)n  1  n(n—  l)(logx)n  2 
[  m+1  (m+  l)2  (m  +  l)3 


- + 


(-1  )nn\ 

(m  +  l)n+1 


41.  Show  that  the  most  general  function  4>(x),  such  that  q i"  +  a2cj)  =  0 
for  all  values  of  x,  may  be  expressed  in  either  of  the  forms  A  cos  ax  +  B  sin  ax, 
pcos(ax  +  e),  where  A,  B ,  p,  e  are  constants.  [Multiplying  by  2(f)'  and  integrating 
we  obtain  (j) '2  +  a2 <f>2  =  a2b2,  where  b  is  a  constant,  from  which  we  deduce  that 


42.  Determine  the  most  general  functions  y  and  z  such  that  y'  +  ujz  =  0, 
and  z'  —  uy  =  0,  where  a;  is  a  constant  and  dashes  denote  differentiation  with 
respect  to  x. 

43.  The  area  of  the  curve  given  by 

sin  a  sin  (j)  .  sin  a  cos  (j) 

x  =  cos0+- - - - r^— ,  y  =  sm0-- - - - , 

1  —  cosz  a  sm  <p  1  —  cosz  a  sin  <fi 


where  a  is  a  positive  acute  angle,  is  \tt(1  +  sin  a)2/ sin  a.  (Math.  Trip.  1904.) 
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44.  The  projection  of  a  chord  of  a  circle  of  radius  a  on  a  diameter  is  of 
constant  length  2a  cos /3;  show  that  the  locus  of  the  middle  point  of  the  chord 
consists  of  two  loops,  and  that  the  area  of  either  is  a2(/3  —  cos  (5  sin  fj). 

(Math.  Trip.  1903.) 

45.  Show  that  the  length  of  a  quadrant  of  the  curve  ( x/a )2/3  +  ( y/h )2//3  =  1 

is  (a2  +  ah  +  b2)/(a  +  b).  (Math.  Trip.  1911.) 

46.  A  point  A  is  inside  a  circle  of  radius  a,  at  a  distance  b  from  the  centre. 

Show  that  the  locus  of  the  foot  of  the  perpendicular  drawn  from  A  to  a  tangent 
to  the  circle  encloses  an  area  7r(a2  +  \b2).  (Math.  Trip.  1909.) 

47.  Prove  that  if  (a,  b ,  c,  f,g,h([  x,  y,  l)2  =  0  is  the  equation  of  a  conic,  then 

f  dx  PT 

J  (lx  +  my  +  n)  (hx  +  by  +  /)  °  PT ' 


where  PT,  PT'  are  the  perpendiculars  from  a  point  P  of  the  conic  on  the 
tangents  at  the  ends  of  the  chord  lx  +  my  +  n  =  0,  and  a ,  f3  are  constants. 

(Math.  Trip.  1902.) 

48.  Show  that 

f  ax 2  +  Tbx  +  c 

J  (Ax2  +  2 Bx  +  C )2  dX 

will  be  a  rational  function  of  x  if  and  only  if  one  or  other  of  AC  —  B2  and 
aC  +  cA  —  2 bB  is  zero.* 

49.  Show  that  the  necessary  and  sufficient  condition  that 


/(*) 


{F(x)}2 


dx , 


where  /  and  F  are  polynomials  of  which  the  latter  has  no  repeated  factor,  should 
be  a  rational  function  of  x ,  is  that  f'F'  —  fF"  should  be  divisible  by  F. 

(Math.  Trip.  1910.) 


50.  Show  that 


a  cos  x  +  f3  sin  x  +  7 
(1  —  ecosx)2 


dx 


is  a  rational  function  of  cosx  and  sinx  if  and  only  if  ae  +  7  =  0;  and  determine 
the  integral  when  this  condition  is  satisfied.  (Math.  Trip.  1910.) 


See  the  author’s  tract  quoted  on  p.  286. 


CHAPTER  VII 


ADDITIONAL  THEOREMS  IN  THE  DIFFERENTIAL  AND 
INTEGRAL  CALCULUS 

147.  Higher  Mean  Value  Theorems.  In  the  preceding  chapter 
(§  125)  we  proved  that  if  f{x)  has  a  derivative  f'(x)  throughout  the  interval 
[a,  b]  then 

/(6)-/(a)  =  (6-a)/'(0, 

where  a  <  £  <  6;  or  that,  if  f(x)  has  a  derivative  throughout  [a,  a  +  h], 
then 

f(a  +  h)~  f(a)  =  hf(a  +  0ih),  (1) 

where  0  <  9\  <  1.  This  we  proved  by  considering  the  function 

f(b)  ~  f(x)  ~  f(b)  ~  /(«)} 

U  LL 


which  vanishes  when  x  =  a  and  when  x  —  b. 

Let  us  now  suppose  that  f{x)  has  also  a  second  derivative  f"(x) 
throughout  [a,  b\,  an  assumption  which  of  course  involves  the  continuity  of 
the  first  derivative  f(x),  and  consider  the  function 


f{b )  ~  f(x)  -  (b-x)f(x) 


if(b)  ~  f(a )  -  (6 


a)f(a)}. 


This  function  also  vanishes  when  x  =  a  and  when  x  =  b;  and  its  derivative 

is 

a) 2  “  /(°)  -  (b  -  «)/'(«)  -  Ub  ~  a)2/"(^)}, 

and  this  must  vanish  (§  121)  for  some  value  of  x  between  a  and  b  (exclusive 
of  a  and  b).  Hence  there  is  a  value  £  of  x,  between  a  and  b ,  and  therefore 
capable  of  representation  in  the  form  a  +  92{b  —  a),  where  0  <  d2  <  1,  for 
which 

f(b)  =  f(a)  +  (b-a)f\a)  +  l(b-a)2f(0- 
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If  we  put  b  =  a  +  h  we  obtain  the  equation 

f(a+h)  —  f  (a)  +  hf'(a )  +  ^h2  f"(a  +  02h),  (2) 

which  is  the  standard  form  of  what  may  be  called  the  Mean  Value  Theorem 
of  the  second  order. 

The  analogy  suggested  by  (1)  and  (2)  at  once  leads  us  to  formulate  the 
following  theorem: 

Taylor’s  or  the  General  Mean  Value  Theorem.  If  f(x)  is  a 
function  of  x  which  has  derivatives  of  the  first  n  orders  throughout  the 
interval  [a,  b\ ,  then 

f(b)  =  /(«)  +  (b  —  a)f(a )  +  - — ^  f"(a)  +  •  •  • 

+  T“wr/i:”"1)(o)  + 

[n  —  1]!  n\ 

where  a  <  f  <  b;  and  if  b  =  a  +  h  then 

f(a  +  h)  —  f  (a)  +  hf(a)  +  |h2///(a)  +  . . . 

UTi-l  Un 

+  — — +  “t/  "  (°  + 

[n  —  lj!  n\ 

where  0  <  9n  <  1. 

The  proof  proceeds  on  precisely  the  same  lines  as  were  adopted  before 
in  the  special  cases  in  which  n  =  1  and  n  =  2.  We  consider  the  function 

Fn{x)  ~  (^)  Fn(a)’ 

where 


( b  —  x)2 

2! 


f'\x) 


( b  ~  x)n  1  ,(W_D 
(n  —  1)!  7 


(x). 


^n(^)  =  /(&)  -  /(a:)  -  (b  -  x)f'(x) 
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This  function  vanishes  for  x  =  a  and  x  =  b]  its  derivative  is 


n(b  —  x)n  1 
( b  -  a)n 


and  there  must  be  some  value  of  x  between  a  and  b  for  which  the  derivative 
vanishes.  This  leads  at  once  to  the  desired  result. 

In  view  of  the  great  importance  of  this  theorem  we  shall  give  at  the 
end  of  this  chapter  another  proof,  not  essentially  distinct  from  that  given 
above,  but  different  in  form  and  depending  on  the  method  of  integration 
by  parts. 


Examples  LV.  1.  Suppose  that  f(x)  is  a  polynomial  of  degree  r.  Then 
f(n\x)  is  identically  zero  when  n  >  r,  and  the  theorem  leads  to  the  algebraical 
identity 

7,2  i r 

f(a  +  h)  =  f(a)  +  hf'(a)  +  — f"  (a )  +  •  •  •  H - -f^r\a). 

2!  r! 

2.  By  applying  the  theorem  to  f(x)  =  1/x,  and  supposing  x  and  x  +  h 
positive,  obtain  the  result 

1  1  h  h2  (—l)n~1hn~1  (~l)nhn 

- = - -  -\ — - —  •••  +  - - - - 1 - - - - - . 

x  +  h  x  x2  x 3  xn  (x  +  9nh)n+l 

[Since 

1  1  h  h2  (— l)n_1/in_1  (-l)"7in 

-  = - —  -j - •••  +  - - - - 1-  A - ' - 

x  +  h  x  x2  x3  xn  xn(x  +  h)  ’ 

we  can  verify  the  result  by  showing  that  xn(x  +  h)  can  be  put  in  the  form 
(x  +  9nh)n+1,  or  that  xn+1  <  xn(x  +  h)  <  (x  +  h)n+1,  as  is  evidently  the  case.] 

3.  Obtain  the  formula 

.  ,  .  h2  .  3  3! 

sm  x  +  a)  =  sm  x  +  h  cos  x - r  sin  x  —  n  — x  +  . . . 

2!  cos 

;.2n—  1  o  | 

+  (-ir^TR - —t  cos  x  +  (-1  )nh2n—(x  +  92nh), 

(2  n  —  lj!  sm 
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the  corresponding  formula  for  cos(x  +  h),  and  similar  formulae  involving  powers 
of  h  extending  up  to  h2n+l . 

4.  Show  that  if  m  is  a  positive  integer,  and  n  a  positive  integer  not  greater 
than  m,  then 


0 x+h)m  =  xm+  (^Jxm~1h+- •  •+  i  )xm“n+1/in-1+ (^)  (x+enh)m-nhn. 

Show  also  that,  if  the  interval  [x,  x  +  h\  does  not  include  x  =  0,  the  formula 
holds  for  all  real  values  of  m  and  all  positive  integral  values  of  n;  and  that,  even 
iix<0<x  +  hoTX  +  h<0<x,  the  formula  still  holds  if  m  —  n  is  positive. 

5.  The  formula  f{x  +  h)  =  f(x)  +  hf  (x  +  9\h)  is  not  true  if  f(x)  =  1/x  and 
x  <  0  <  x  +  h.  [For  f(x  +  h)  —  f(x)  >  0  and  hf'(x  +  9\h)  =  —h/(x  +  9\h)2  <  0; 
it  is  evident  that  the  conditions  for  the  truth  of  the  Mean  Value  Theorem  are 
not  satisfied.] 

6.  If  x  =  —a,  h  =  2a,  f(x)  =  x1/3,  then  the  equation 

f(x  +  h)  =  f(x)  +  hf(x  +  91  h) 

is  satisfied  by  9\  =  ±  ^  \/3-  [This  example  shows  that  the  result  of  the  theorem 

may  hold  even  if  the  conditions  under  which  it  was  proved  are  not  satisfied.] 

7.  Newton’s  method  of  approximation  to  the  roots  of  equations. 
Let  £  be  an  approximation  to  a  root  of  an  algebraical  equation  f(x)  =  0,  the 
actual  root  being  £  +  h.  Then 

0  =  /(£  +  &)  =  m  +  hf'(0  +  \h2f "(£  +  e2h), 


so  that 

h  =  _m__ih2  ne+M 
m  2  m  ' 

It  follows  that  in  general  a  better  approximation  than  x  =  £  is 


x  =  £ 


m 

m' 


If  the  root  is  a  simple  root,  so  that  f'(£+h)  V  0,  we  can,  when  h  is  small  enough, 
find  a  positive  constant  K  such  that  \  f'{x)\  >  K  for  all  the  values  of  x  which 
we  are  considering,  and  then,  if  h  is  regarded  as  of  the  first  order  of  smallness, 
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/(£)  is  of  the  first  order  of  smallness,  and  the  error  in  taking  £  —  {/(£)/ f  (£,)} 
as  the  root  is  of  the  second  order. 

8.  Apply  this  process  to  the  equation  x2  =  2,  taking  £  =  3/2  as  the  first 
approximation.  [We  find  h  =  — 1/12,  £  +  h  =  17/12  =  1.417 . . . ,  which  is  quite  a 
good  approximation,  in  spite  of  the  roughness  of  the  first.  If  now  we  repeat  the 
process,  taking  £  =  17/12,  we  obtain  £  +  h  =  577/408  =  1.414  215  . . . ,  which  is 
correct  to  5  places  of  decimals.] 

9.  By  considering  in  this  way  the  equation  x2  —  1  —  y  =  0,  where  y  is  small, 
show  that  y/1  +  y  =  1  +  \y  —  {\y2 /{ 2  +  y)}  approximately,  the  error  being  of 
the  fourth  order. 

10.  Show  that  the  error  in  taking  the  root  to  be  £  —  (///')  —  \{f2  f"  /  f'^) , 
where  £  is  the  argument  of  every  function,  is  in  general  of  the  third  order. 

11.  The  equation  sin  x  =  ax,  where  a  is  small,  has  a  root  nearly  equal  to  n. 
Show  that  (1  —  a) n  is  a  better  approximation,  and  (1  —  a  +  a2)7r  a  better  still. 
[The  method  of  Exs.  7-  10  does  not  depend  on  f{x )  =  0  being  an  algebraical 
equation,  so  long  as  f  and  f"  are  continuous.] 

12.  Show  that  the  limit  when  h  — >  0  of  the  number  9n  which  occurs  in  the 
general  Mean  Value  Theorem  is  l/(n+l),  provided  that  /(n+1)(x)  is  continuous. 

[For  f(x  +  h)  is  equal  to  each  of 

un  Ln  i,  n+1 

f(x)-\ - 1 — 7  f^l\x  +  9nh),  /( x)  +  -  ■  -H — -/^(x)  +  y —  f[n+1\x+6n+ih), 

n\  n\  (n  +  1) ! 

where  9n+ 1  as  well  as  9n  lies  between  0  and  1.  Hence 


fW(x  +  9nh) 


f(n)(x)  +  hf{n+1)(x  +  9n+1h) 


But  if  we  apply  the  original  Mean  Value  Theorem  to  the  function  f^n\x),  taking 
9nh  in  place  of  h,  we  find 


f{n)(x  +  9nh)  =  /W(  x)  +  9nhf(n+1\x  +  99nh ), 


where  9  also  lies  between  0  and  1.  Hence 


9nf{n+1)(x  +  99nh) 


/("+1)(x  +  9n+lh) 
n+  1 


from  which  the  result  follows,  since  f(n+1\x  +  99nh)  and  f^n+l\x  +  9njrih)  tend 
to  the  same  limit  f^n+1\x)  as  h  — >  0.] 
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13.  Prove  that  {f{x+2h)  —  2f(x+h)  +  f(x)}/h?  — >  f"(x)  as  h  — >  0,  provided 
that  f"(x)  is  continuous.  [Use  equation  (2)  of  §  147.] 

14.  Show  that,  if  the  fin\x)  is  continuous  for  x  =  0,  then 

fix)  =  ao  +  aix  +  CL2X2  H - h  (an  +  ex)xn, 

where  ar  =  /^(0)/r!  and  ex  0  as  x  — >  0.* 

15.  Show  that  if 

a0  +  a\x  +  a2x2  H - h  (an  +  ex)xn  =  b0  +  b\x  +  b2x2  H - h  (bn  +  r\x)xn , 

where  ex  and  rjx  tend  to  zero  as  x  — >  0,  then  ao  =  bo,  ai  =  b\ ,  . . . ,  an  =  bn. 
[Making  x  — >  0  we  see  that  ao  =  &o-  Now  divide  by  x  and  afterwards  make 
x  — >  0.  We  thus  obtain  ai  =  b\ ;  and  this  process  may  be  repeated  as  often  as  is 
necessary.  It  follows  that  if  f(x)  =  ao  +  a\x  +  a2x2  +  ■  •  •  +  (an  +  ex)xn,  and  the 
first  n  derivatives  of  f(x)  are  continuous,  then  ar  =  fir\0)/r\.] 


148.  Taylor’s  Series.  Suppose  that  f{x)  is  a  function  all  of  whose 
differential  coefficients  are  continuous  in  an  interval  [ a—rj ,  a+ij]  surrounding 
the  point  x  =  a.  Then,  if  h  is  numerically  less  than  r),  we  have 

hn~ i  hn 

f(a  +  h )  =  f{a)  +  hf\a)  +  •  •  •  +  - urr/1'”  ^(a)  4 — vf^n\a  +  &nh), 

[n  —  lj!  n\ 

where  0  <  9n  <  1,  for  all  values  of  n.  Or,  if 

^  ^  7  77, 

S"  =  EVH(“)>  R.  =  V(n)(o  +  9„fc). 

■ '  i/!  n! 


we  have 

/(a  +  h)  -Sn  =  Rn. 

*It  is  in  fact  sufficient  to  suppose  that  /^(O)  exists.  See  R.  H.  Fowler,  “The 
elementary  differential  geometry  of  plane  curves”  ( Cambridge  Tracts  in  Mathematics , 
No.  20,  p.  104). 
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Now  let  us  suppose,  in  addition,  that  we  can  prove  that  Rn  A  0  as 
n  — >  oo.  Then 

f(a  +  h)  =  lim  Sn  =  f(a )  +  hf'(a)  +  f"(a )  +  .... 

7i— >■  OO  z! 

This  expansion  of  /(a  +  h)  is  known  as  Taylor’s  Series.  When  a  =  0 
the  formula  reduces  to 

f(h)  =f(  0)  +  hf(  0)  +  ^j-/"(0)  +  . . . , 

which  is  known  as  Maclaurin’s  Series.  The  function  Rn  is  known  as 
Lagrange’s  form  of  the  remainder. 

The  reader  should  be  careful  to  guard  himself  against  supposing  that  the 
continuity  of  all  the  derivatives  of  /(x)  is  a  sufficient  condition  for  the  validity 
of  Taylor’s  series.  A  direct  discussion  of  the  behaviour  of  Rn  is  always  essential. 


Examples  LVI.  1.  Let  /(x)  =  sinx.  Then  all  the  derivatives  of  f(x) 
are  continuous  for  all  values  of  x.  Also  |/n(x)|  ^  1  for  all  values  of  x  and  n. 
Hence  in  this  case  |f?n|  ^  hn/n\,  which  tends  to  zero  as  n  A  oo  (Ex.  xxvn.  12) 
whatever  value  h  may  have.  It  follows  that 

....  h?  .  h 3  h4  . 

sm  x  +  h)  =  smx  +  n  cosx - r  smx - r  cosx  +  —  smx  +  . . . , 

2!  3!  4! 

for  all  values  of  x  and  h.  In  particular 

h 3  h 5 

amh  =  h-  —  +  - -..., 

for  all  values  of  h.  Similarly  we  can  prove  that 

,  ,  h?  h3  .  h2  h4 

cos  (x  +  h)  =  cos  x  —  h  smx - r  cosxH — r  sinx  + . . . ,  cos  h  =  1 - r  +  wr  —  •••• 

2!  3!  2!  4! 


2.  The  Binomial  Series.  Let  /(x)  =  (1  +  x)m,  where  m  is  any  rational 
number,  positive  or  negative.  Then  j(n)(x)  =  m(m  —  1) . . .  (m  —  n  +  l)(l  +  x)m_n 
and  Maclaurin’s  Series  takes  the  form 


x  +  .... 


(l  +  x)m  =  1  + 
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When  m  is  a  positive  integer  the  series  terminates,  and  we  obtain  the  ordi¬ 
nary  formula  for  the  Binomial  Theorem  with  a  positive  integral  exponent.  In 
the  general  case 


Rn  =  ~^f{n\enx)  = 
n\ 


m 


n 


Xn(l  +  6nx)r 


and  in  order  to  show  that  Maclaurin’s  Series  really  represents  (1  +  x)m  for  any 
range  of  values  of  x  when  m  is  not  a  positive  integer,  we  must  show  that  R„  — »  0 
for  every  value  of  x  in  that  range.  This  is  so  in  fact  if  —1  <  x  <  1,  and  may  be 
proved,  when  0  ^  x  <  1,  by  means  of  the  expression  given  above  for  Rn,  since 

/  777,  \ 

(1  +  9nx)m~n  <  1  if  n  >  m,  and  )xn  — >  0  as  n  — >•  oo  (Ex.  xxvil.  13).  But  a 


\n  j 

difficulty  arises  if  —  1  <  x  <  0,  since  1  +  9nx  <  1  and  (1  +  9nx)m~n  >  1  if  n  >  m; 
knowing  only  that  0  <  9n  <  1,  we  cannot  be  assured  that  1  +  9nx  is  not  quite 
small  and  (1  +  9nx)m~n  quite  large. 

In  fact,  in  order  to  prove  the  Binomial  Theorem  by  means  of  Taylor’s  The¬ 
orem,  we  need  some  different  form  for  Rn,  such  as  will  be  given  later  (§  162). 


149.  Applications  of  Taylor’s  Theorem.  A.  Maxima  and 
minima.  Taylor’s  Theorem  may  be  applied  to  give  greater  theoretical 
completeness  to  the  tests  of  Ch.  VI,  §§  122-123,  though  the  results  are  not 
of  much  practical  importance.  It  will  be  remembered  that,  assuming  that 
(j>(x)  has  derivatives  of  the  first  two  orders,  we  stated  the  following  as  being 
sufficient  conditions  for  a  maximum  or  minimum  of  0(x)  at  x  =  £:  for  a 
maximum ,  (f>'(£)  =  0,  (f>"(£)  <  0;  for  a  minimum ,  (f>'(£)  =  0,  (f>"(£)  >  0.  It 
is  evident  that  these  tests  fail  if  (f>"(£)  as  well  as  0;(O  is  zero. 

Let  us  suppose  that  the  first  n  derivatives 

are  continuous,  and  that  all  save  the  last  vanish  when  x  —  £.  Then,  for 
sufficiently  small  values  of  h, 

jn 

n\ 
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In  order  that  there  should  be  a  maximum  or  a  minimum  this  expression 
must  be  of  constant  sign  for  all  sufficiently  small  values  of  h,  positive  or 
negative.  This  evidently  requires  that  n  should  be  even.  And  if  n  is  even 
there  will  be  a  maximum  or  a  minimum  according  as  0 ^  (£)  is  negative  or 
positive. 

Thus  we  obtain  the  test:  if  there  is  to  be  a  maximum  or  minimum  the 
first  derivative  which  does  not  vanish  must  be  an  even  derivative,  and  there 
will  be  a  maximum  if  it  is  negative,  a  minimum  if  it  is  positive. 


Examples  LVII.  1.  Verify  the  result  when  0(x)  =  (x  —  a)m,  m  being 
a  positive  integer,  and  £  =  a. 

2.  Test  the  function  ( x  —  a)m{x  —  b)n,  where  rn  and  n  are  positive  integers, 
for  maxima  and  minima  at  the  points  x  =  a,  x  =  b.  Draw  graphs  of  the  different 
possible  forms  of  the  curve  y  =  (x  —  a)m(x  —  b)n. 


X ^  X ^ 

3.  Test  the  functions  sinx  —  x,  sin  x  —  x  H - ,  sin  a™  —  x  H - ,  . . . , 

’  6  ’  6  120’  ’ 
2  2  4 

.  X  X  X 

cos  x  —  1,  cos  x  —  1  +  — ,  cos  x  —  1  +  — —  — ,  ...  ror  maxima  or  minima  at  x  =  0. 


150.  B.  The  calculation  of  certain  limits.  Suppose  that  f(x) 
and  4>(x)  are  two  functions  of  x  whose  derivatives  fix)  and  4>'(x)  are 
continuous  for  x  =  f  and  that  /(£)  and  0(£)  are  both  equal  to  zero.  Then 
the  function 

'lf(x)  =  f(x)/(j)(x) 

is  not  defined  when  x  =  f.  But  of  course  it  may  well  tend  to  a  limit  as 
x->£. 

Now 

f(x)  =  f(x)  -  /(£)  =  (x  - i)f(x i), 

where  X\  lies  between  £  and  x;  and  similarly  <f>(x)  —  (x  —  f)4>'{x 2),  where 
X2  also  lies  between  £  and  x.  Thus 

0(x)  = 


We  must  now  distinguish  four  cases. 
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(1)  If  neither  /'(£)  nor  0'(O  is  zero,  then 

m/<Kx)  mmt)- 

(2)  If  /'  (0  =  0,  0'(O  0  0,  then 

f(x)/(f)(x)  -A  0. 

(3)  If  /'(£)  0  0,  0'(O  =  0,  then  f{x)/(j>(x)  becomes  numerically  very 
large  as  x  — »  0  but  whether  f(x)/<f>(x)  tends  to  oo  or  —  oo,  or  is  some¬ 
times  large  and  positive  and  sometimes  large  and  negative,  we  cannot  say, 
without  further  information  as  to  the  way  in  which  <f'(x)  — >  0  as  x  -A  0 

(4)  If  /'(£)  =  0,  0'(0  =  0)  then  we  can  as  yet  say  nothing  about  the 
behaviour  of  f(x)/(f>(x)  as  x  — >  0. 

But  in  either  of  the  last  two  cases  it  may  happen  that  f(x)  and  <fi(x) 
have  continuous  second  derivatives.  And  then 

nx)  =  nx)  -  /(o  -  (x  -  om  =  i(x-  o2r(x  i), 

</>(%)  =  4>{x)  -  0(0  -{x-  00(0  =  \{x-  0  V'OuO 

where  again  X\  and  X2  lie  between  £  and  x;  so  that 

0(0  =  f(x  1W'{X2). 

We  can  now  distinguish  a  variety  of  cases  similar  to  those  considered  above. 
In  particular,  if  neither  second  derivative  vanishes  for  x  =  0  we  have 

/(OMO  r(o/0"(o- 

It  is  obvious  that  this  argument  can  be  repeated  indefinitely,  and  we 
obtain  the  following  theorem:  suppose  that  f{x )  and  <f>(x)  and  their  deriva¬ 
tives,  so  far  as  may  be  wanted,  are  continuous  for  x  —  0  Suppose  further 
that  f(p\x)  and  <f^q\x)  are  the  first  derivatives  of  f(x)  and  <fi(x)  which  do 
not  vanish  when  x  —  0  Then 

(1)  ifp  =  q,  f{x)/<j>{x)  -A  /(p)( 0/Op)(0; 

(2)  ifp>q,f(x)/(/>(x)->0; 
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(3)  if  p  <  q,  and  q  —  p  is  even,  either  f(x)/<j)(x)  -A  +oo  or 
f{x)/4>{x)  — »  —  oo,  the  sign  being  the  same  as  that  of  /^(£)/0^(£)> 

(4)  if  p  <  q  and  q—p  is  odd,  either  f(x)/4>(x)  -A  +cx)  or 
f(x)/4>{x)  -A  —  oo;  as  a;  — *  £  +  07  the  sign  being  the  same  as  that  of 
/^(£)/0^(£),  while  if  x  — »  £  —  0  £/ie  sign  must  be  reversed. 

This  theorem  is  in  fact  an  immediate  corollary  from  the  equations 

f(x)  =  (X  ^  f(p)(x i),  0(x)  =  —  ^  0(g)(^2)- 

p!  g! 


Examples  LVIII.  1.  Find  the  limit  of 

{x  —  (n  +  l)xn+1  +  nxn+2} /(I  —  x)2, 

as  x  -A  1.  [Here  the  functions  and  their  first  derivatives  vanish  for  x  =  1,  and 
/"(l)  =  n(n  +  l),  f"(l)  =  2.} 

2.  Find  the  limits  as  x  -A  0  of 

(tanx  —  x)/(x  —  sinx),  (tan  nx  —  n  tan  x) /(n  sin  x  —  sinnx). 


3.  Find  the  limit  of  x{\/x2  +  a2  —  x}  as  x  — >  oo.  [Put  x  =  1/y.] 

4.  Prove  that 


lim  (x  —  n)  cosec  X7r  = 


(-D 


n  1  f  (-1)? 

lim - <  cosec  X7r  — 


7 r 


x  —  n 


{x  —  Tl)  7T 


(— l)nvr 
6  ’ 


n  being  any  integer;  and  evaluate  the  corresponding  limits  involving  cotx7T. 

5.  Find  the  limits  as  x  — >  0  of 


1 

(  1 

x\ 

1 

/  +  1 

x\ 

-  1 

cosec  x - 

-  -  , 

— 

cot  x - 

+  ~  ) 

X3 

V  X 

6/ 

X3 

V  X 

3/ 

6.  (sinx  arc  sinx  —  x2)/x6  -A  yg,  (tanx  arc  tanx  —  x2)/x6  -A  as  x  — >  0. 
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151.  C.  The  contact  of  plane  curves.  Two  curves  are  said  to 
intersect  (or  cut)  at  a  point  if  the  point  lies  on  each  of  them.  They  are 
said  to  touch  at  the  point  if  they  have  the  same  tangent  at  the  point. 

Let  us  suppose  now  that  f(x),  (j)(x)  are  two  functions  which  possess 
derivatives  of  all  orders  continuous  for  x  =  f,  and  let  us  consider  the 
curves  y  =  f(x),  y  =  4>(x).  In  general  /(f)  and  /(f)  will  not  be  equal.  In 
this  case  the  abscissa  x  =  f  does  not  correspond  to  a  point  of  intersection 
of  the  curves.  If  however  /(f)  =  /(f),  the  curves  intersect  in  the  point 
x  —  f,  y  —  /(f)  =  /(f).  Let  us  suppose  this  to  be  the  case.  Then  in  order 
that  the  curves  should  not  only  cut  but  touch  at  this  point  it  is  obviously 
necessary  and  sufficient  that  the  first  derivatives  fix),  (f>'{x)  should  also 
have  the  same  value  when  x  —  f . 

The  contact  of  the  curves  in  this  case  may  be  regarded  from  a  different 
point  of  view.  In  the  figure  the  two  curves  are  drawn  touching  at  P,  and 


QR  is  equal  to  /(f  +  h)  -  f(£  +  h ),  or,  since  0(f)  =  /(f),  /'(f) 

|h2{/'/(f+^)-r(f+^)}, 

where  6  lies  between  0  and  1.  Hence 


1™  =  IU"«)  - /"«)}, 


/'(f),  to 


[VII :  151]  ADDITIONAL  THEOREMS  IN  THE  CALCULUS 


331 


when  h  — >  0.  In  other  words,  when  the  curves  touch  at  the  point  whose 
abscissa  is  £,  the  difference  of  their  ordinates  at  the  point  whose  abscissa 
is  f  +  h  is  at  least  of  the  second  order  of  smallness  when  h  is  small. 

The  reader  will  easily  verify  that  lim  (QR/h)  =  <p'  (f)  —  f'iff)  when  the  curves 
cut  and  do  not  touch,  so  that  QR  is  then  of  the  first  order  of  smallness  only. 

It  is  evident  that  the  degree  of  smallness  of  QR  may  be  taken  as  a 
kind  of  measure  of  the  closeness  of  the  contact  of  the  curves.  It  is  at  once 
suggested  that  if  the  first  n  —  1  derivatives  of  /  and  <f>  have  equal  values 
when  x  =  £,  then  QR  will  be  of  nth  order  of  smallness;  and  the  reader  will 
have  no  difficulty  in  proving  that  this  is  so  and  that 

iim|^  =  t[fU(”>(e)-/<")K)}. 

We  are  therefore  led  to  frame  the  following  definition: 

Contact  of  the  nth  order.  If  /(£)  =  </>(£),  /'(£)  =  ■■■, 

/(”)(£)  =  </>(”)(£),  but  f{n+1)(f)  7^  0(n+1)(£),  then  the  curves  y  =  f(x), 
y  =  4>(x)  will  be  said  to  have  contact  of  the  nth  order  at  the  point  whose 
abscissa  is  £. 

The  preceding  discussion  makes  the  notion  of  contact  of  the  nth  order 
dependent  on  the  choice  of  axes,  and  fails  entirely  when  the  tangent  to  the 
curves  is  parallel  to  the  axis  of  y.  We  can  deal  with  this  case  by  taking  y  as 
the  independent  and  x  as  the  dependent  variable.  It  is  better,  however,  to 
consider  x  and  y  as  functions  of  a  parameter  t.  An  excellent  account  of 
the  theory  will  be  found  in  Mr  Fowler’s  tract  referred  to  on  p.  324,  or  in 
de  la  Vallee  Poussin’s  Cours  d' Analyse,  vol.  ii,  pp.  396  et  seq. 

Examples  LIX.  1.  Let  4>(x)  =  ax  +  b,  so  that  y  =  <j>{x)  is  a  straight 
line.  The  conditions  for  contact  at  the  point  for  which  x  =  £  are  /(£)  =  a£  +  b, 
f(0  =  a-  If  we  determine  a  and  b  so  as  to  satisfy  these  equations  we  find 
a  =  /'(£),  b  =  /(£)  —  and  the  equation  of  the  tangent  to  y  =  f(x)  at  the 

point  x  =  £  is 

y  =  xf'{Q  +  {f{Q-U'm, 

or  y  -  /(£)  =  (x  —  £)/'(£).  Cf.  Ex.  xxxix.  5. 
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2.  The  fact  that  the  line  is  to  have  simple  contact  with  the  curve  completely 
determines  the  line.  In  order  that  the  tangent  should  have  contact  of  the  second 
order  with  the  curve  we  must  have  /"(£)  =  $'(£,),  i.e.  f"(f)  =  0.  A  point  at 
which  the  tangent  to  a  curve  has  contact  of  the  second  order  is  called  a  point 
of  inflexion. 

3.  Find  the  points  of  inflexion  on  the  graphs  of  the  functions  3x4  —  6x3  +  1, 
2x/(l  +  x2),  sinx,  a  cos2  x  +  b  sin2  x,  tan  x,  arc  tan  x. 

4.  Show  that  the  conic  ax2  +  2 hxy  +  by2  +  2 gx  +  2  fy  +  c  =  0  cannot  have 
a  point  of  inflexion.  [Here  ax  +  hy  +  g  +  (hx  +  by  +  f)yi  =  0  and 

a  +  2hyi  +  by2  +  (hx  +  by  +  f)y2  =  0, 
suffixes  denoting  differentiations.  Thus  at  a  point  of  inflexion 

a  +  2hyi  +  byj  =  0, 


or 


a(hx  +  by  +  /)2  —  2  h(ax  +  hy  +  g)(hx  +  by  +  f)  +  b(ax  +  hy  +  g )2  =  0, 


or 


( ab  —  h2){ax 2  +  2  hxy  +  by 2  +  2  gx  +  2  fy}  +  af2  —  2  fgh  +  bg2  =  0. 

But  this  is  inconsistent  with  the  equation  of  the  conic  unless 

af 2  —  2  fgh  +  bg2  =  c(ab  —  h2) 

or  abc  +  2 fgh  —  af 2  —  bg2  —  ch 2  =  0;  and  this  is  the  condition  that  the  conic 
should  degenerate  into  two  straight  lines.] 

5.  The  curve  y  =  (ax2  +  2 bx  +  c)/(ax2  +  2f3x  +  7)  has  one  or  three  points 
of  inflexion  according  as  the  roots  of  ax2  +  2/3x  +  7  =  0  are  real  or  complex. 

[The  equation  of  the  curve  can,  by  a  change  of  origin  (cf.  Ex.  xlvi.  15),  be 
reduced  to  the  form 

V  =  iKM2  +  2 Bi  +  c)  =  £/{A(£  -  m  -  q)}, 

where  p,  q  are  real  or  conjugate.  The  condition  for  a  point  of  inflexion  will 
be  found  to  be  £3  —  3 pqf  +  pq(p  +  q)  =  0,  which  has  one  or  three  real  roots 
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according  as  {pq(p  —  c/)}2  is  positive  or  negative,  i.e.  according  as  p  and  q  are 
real  or  conjugate.] 

6.  Discuss  in  particular  the  curves  y  =  (l—x)/(l +x2),y  =  (1— x2)/(l+x2), 
y  =  (1  +  x2)/(l  -  x 2). 

7.  Show  that  when  the  curve  of  Ex.  5  has  three  points  of  inflexion,  they  lie 
on  a  straight  line.  [The  equation  £3  —  3 pq£  +  pq(p  +  q)  =  0  can  be  put  in  the 
form  (£  —  p)(£  —  q)(£  +  p  +  q)  +  (p  —  q )2£  =  0,  so  that  the  points  of  inflexion  lie 
on  the  line  £  +  A(p  —  q)2rq  +  p  +  g  =  0  or  -  4 {AC  —  B2)rj  =  2 B.) 

8.  Show  that  the  curves  y  =  x  sin  x,  y  =  (sin  x)/x  have  each  infinitely  many 
points  of  inflexion. 

9.  Contact  of  a  circle  with  a  curve.  Curvature.*  The  general  equation 
of  a  circle,  viz. 

(x  -  a)2  +  (y  -  b)2  =  r2,  (1) 

contains  three  arbitrary  constants.  Let  us  attempt  to  determine  them  so  that 
the  circle  has  contact  of  as  high  an  order  as  possible  with  the  curve  y  =  f(x)  at 
the  point  (£,  77),  where  77  =  /(£).  We  write  771,  772  for  //(^),  /"{£,)■  Differentiating 
the  equation  of  the  circle  twice  we  obtain 


{x  -  a)  +  {y  -  6)7/1  =  0,  (2) 

1  +  y\  +  (y  ~  b)y 2  =  0.  (3) 


If  the  circle  touches  the  curve  then  the  equations  (1)  and  (2)  are  satisfied 
when  x  =  £,  y  =  77,  7/1  =  771.  This  gives  (£  —  a)/r]i  =  —(77  —  6)  =  r/ \J\  +  rj2.  If 
the  contact  is  of  the  second  order  then  the  equation  (3)  must  also  be  satisfied 
when  7/2  =  772.  Thus  b  =  77  +  {(1  +  77^ ) / 772 } ;  and  hence  we  find 

771(1  +  ??i)  u  l  +  >)i  (1  +  ?7i)3/2 

a  =  £ - ,  6  =  77  H - ,  r  = - . 

m  m  V2 

The  circle  which  has  contact  of  the  second  order  with  the  curve  at  the  point 
(£,  77)  is  called  the  circle  of  curvature,  and  its  radius  the  radius  of  curvature. 
The  measure  of  curvature  (or  simply  the  curvature )  is  the  reciprocal  of  the 
radius:  thus  the  measure  of  curvature  is  ///(£)/{  1  +  [f{C)]2}3^2^  or 


d2i  7 

d e2 


/ 


1  + 


2n  3/2 


*A  much  fuller  discussion  of  the  theory  of  curvature  will  be  found  in  Mr  Fowler’s 
tract  referred  to  on  p.  324. 
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10.  Verify  that  the  curvature  of  a  circle  is  constant  and  equal  to  the  reciprocal 
of  the  radius;  and  show  that  the  circle  is  the  only  curve  whose  curvature  is 
constant. 

11.  Find  the  centre  and  radius  of  curvature  at  any  point  of  the  conics 
y 2  =  4 ax,  ( x/a )2  +  ( y/b )2  =  1. 

12.  In  an  ellipse  the  radius  of  curvature  at  P  is  CD3/ab ,  where  CD  is  the 
semi-diameter  conjugate  to  CP. 

13.  Show  that  in  general  a  conic  can  be  drawn  to  have  contact  of  the  fourth 
order  with  the  curve  y  =  f(x)  at  a  given  point  P. 

[Take  the  general  equation  of  a  conic,  viz. 

ax 2  +  2  hxy  +  by 2  +  2  gx  +  2  fy  +  c  =  0, 

and  differentiate  four  times  with  respect  to  x.  Using  suffixes  to  denote  differen¬ 
tiation  we  obtain 


ax  +  hy  +  g  +  (hx  +  by  +  f)yi  =  0, 

a  +  2hyi  +  byf  +  (hx  +  by  +  f)y2  =  0, 

3  (h  +  byi)y2  +  (hx  +  by  +  f)y3  =  0, 

4  (h  +  byi)y3  +  3  byj  +  (hx  +  by  +  f)y4  =  0. 

If  the  conic  has  contact  of  the  fourth  order,  then  these  five  equations  must  be 

satisfied  by  writing  £,  77,  771,  rj2,  m,  V4 ■  for  x,  y,  yi,  y2,  y3,  y4.  We  have  thus  just 

enough  equations  to  determine  the  ratios  a  :  b  :  c  :  f  :  g  :  h.) 

14.  An  infinity  of  conics  can  be  drawn  having  contact  of  the  third  order  with 
the  curve  at  P.  Show  that  their  centres  all  lie  on  a  straight  line. 

[Take  the  tangent  and  normal  as  axes.  Then  the  equation  of  the  conic  is  of 
the  form  2 y  =  ax 2  +  2 hxy  +  by2,  and  when  x  is  small  one  value  of  y  may  be 
expressed  (Ch.  V,  Misc.  Ex.  22)  in  the  form 

y  =  \ax2  +  (\ah  +  ex )  x3, 

where  ex  — >  0  with  x.  But  this  expression  must  be  the  same  as 

y  =  o)®2  +  {g/W(0)  + 

where  — >  0  with  x,  and  so  a  =  f"( 0),  h  =  /"'(0)/3/"(0),  in  virtue  of  the 

result  of  Ex.  LV.  15.  But  the  centre  lies  on  the  line  ax  +  hy  =  0.] 
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15.  Determine  a  parabola  which  has  contact  of  the  third  order  with  the 
ellipse  ( x/a )2  +  ( y/b )2  =  1  at  the  extremity  of  the  major  axis. 

16.  The  locus  of  the  centres  of  conics  which  have  contact  of  the  third  order 
with  the  ellipse  (x/a)2  +  (y/b)2  =  1  at  the  point  (a  cos  a,  b  sin  a)  is  the  diameter 
x/(acosa)  =  y/(b  sin  a).  [For  the  ellipse  itself  is  one  such  conic.] 

152.  Differentiation  of  functions  of  several  variables.  So  far 

we  have  been  concerned  exclusively  with  functions  of  a  single  variable  x, 
but  there  is  nothing  to  prevent  us  applying  the  notion  of  differentiation  to 
functions  of  several  variables  x,  y,  .... 

Suppose  then  that  f(x,y)  is  a  function  of  two*  real  variables  x  and  y, 
and  that  the  limits 

lim  f(x  +  h,y)  -  f(x,y)  ^  f(x,y  +  k)  -  f(x,y) 
h — >0  lx  k — ^0 

exist  for  all  values  of  x  and  y  in  question,  that  is  to  say  that  f(x,  y)  possesses 
a  derivative  df/dx  or  Dxf(x,y )  with  respect  to  x  and  a  derivative  df/dy 
or  Dyf(x,y)  with  respect  to  y.  It  is  usual  to  call  these  derivatives  the 
partial  differential  coefficients  of  /,  and  to  denote  them  by 

dj_  d£ 

dx ’  dy 
or 

fx(x,y),  fy(x,y) 

or  simply  ff  f  or  fx,  fy .  The  reader  must  not  suppose,  however,  that  these 
new  notations  imply  any  essential  novelty  of  idea:  ‘partial  differentiation’ 
with  respect  to  x  is  exactly  the  same  process  as  ordinary  differentiation,  the 
only  novelty  lying  in  the  presence  in  /  of  a  second  variable  y  independent 
of  x. 

In  what  precedes  we  have  supposed  x  and  y  to  be  two  real  variables 
entirely  independent  of  one  another.  If  x  and  y  were  connected  by  a  relation 

*The  new  points  which  arise  when  we  consider  functions  of  several  variables  are 
illustrated  sufficiently  when  there  are  two  variables  only.  The  generalisations  of  our 
theorems  for  three  or  more  variables  are  in  general  of  an  obvious  character. 
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the  state  of  affairs  would  be  very  different.  In  this  case  our  definition  of  f'x 
would  fail  entirely,  as  we  could  not  change  x  into  x  +  h  without  at  the  same 
time  changing  y.  But  then  /(x,  y )  would  not  really  be  a  function  of  two 
variables  at  all.  A  function  of  two  variables,  as  we  defined  it  in  Ch.  II,  is 
essentially  a  function  of  two  independent  variables.  If  y  depends  on  x,  y  is 
a  function  of  x,  say  y  =  0(x);  and  then 

f(x,y)  =  f{x,<j>(x)} 

is  really  a  function  of  the  single  variable  x.  Of  course  we  may  also  represent 
it  as  a  function  of  the  single  variable  y.  Or,  as  is  often  most  convenient, 
we  may  regard  x  and  y  as  functions  of  a  third  variable  t ,  and  then  /(x,  y), 
which  is  of  the  form  /{0(f), -0(f)},  is  a  function  of  the  single  variable  f. 


Examples  LX.  1.  Prove  that  if  x  =  rcos#,  y  =  rsin#,  so  that 
r  =  \J x1  +  y 2,  9  =  arc  tan (y/x),  then 


dr 

X 

dr 

y 

d9 

y 

d9 

X 

dx 

^Jx2  +  y2' 

dy 

\A2  +  y2  ’ 

dx 

x2  +  y2  ’ 

dy 

x2  +  y2 

dx 

=  cos  9, 

dy 

=  sin#, 

dx 

— r  sin#, 

dy 

=  r  cos  #. 

dr 

dr 

d9 

d9 

2.  Account  for  the  fact  that  +  1  /  ( and  /  1  /  ( Y  [When 

ox  /  \or  J  ox  /  \o9  J 

we  were  considering  a  function  y  of  one  variable  x  it  followed  from  the  definitions 
that  dy/dx  and  dx/dy  were  reciprocals.  This  is  no  longer  the  case  when  we  are 
dealing  with  functions  of  two  variables.  Let  P  (Fig.  46)  be  the  point  (x,  y)  or 
(r,  9).  To  find  dr/dx  we  must  increase  x,  say  by  an  increment  MM\  =  5x,  while 
keeping  y  constant.  This  brings  P  to  Pi.  If  along  OP\  we  take  OP'  =  OP,  the 
increment  of  r  is  P' P\  =  5r ,  say;  and  dr/dx  =  lim(5r/(fx).  If  on  the  other  hand 
we  want  to  calculate  dx/dr ,  x  and  y  being  now  regarded  as  functions  of  r  and  9, 
we  must  increase  r  by  Ar,  say,  keeping  9  constant.  This  brings  P  to  P2,  where 
PP2  =  Ar:  the  corresponding  increment  of  x  is  MM\  =  Ax,  say;  and 


dx/dr  =  lim(Ax/Ar). 
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Now  Ax  =  8x:*  but  A r  5r.  Indeed  it  is  easy  to  see  from  the  figure  that 

\im{8r  /  8x)  =  Pi  /  P  Pi)  =  cos  #, 


but 

lim(Ar/Ax)  =  \\m.(P  P2  /  P  Pi)  =  sec# , 

so  that 

liinfAr/Ar)  =  cos2#. 

The  fact  is  of  course  that  dx/dr  and  dr/dx  are  not  formed  upon  the  same 
hypothesis  as  to  the  variation  of  P.\ 

3.  Prove  that  if  z  =  f(ax  +  by)  then  b(dz/dx)  =  a(dz/dy). 

4.  Find  dX/dx,  dX/dy,  . . .  when  X  +  Y  =  x,  Y  =  xy.  Express  x,  y  as 
functions  of  X,  Y  and  find  dx/dX,  dx/dY,  .... 

5.  Find  8X/dx,  . . .  when  X  +  Y  +  Z  =  x,  Y  +  Z  =  xy,  Z  =  xyz\  express 
x,  y,  z  in  terms  of  X,  Y,  Z  and  find  dx/dX,  .... 

[There  is  of  course  no  difficulty  in  extending  the  ideas  of  the  last  section 
to  functions  of  any  number  of  variables.  But  the  reader  must  be  careful  to 

*Of  course  the  fact  that  Ax  =  Sx  is  due  merely  to  the  particular  value  of  Ar  that  we 
have  chosen  (viz.  PP?).  Any  other  choice  would  give  us  values  of  Ax,  A?’  proportional 
to  those  used  here. 
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impress  on  his  mind  that  the  notion  of  the  partial  derivative  of  a  function  of 
several  variables  is  only  determinate  when  all  the  independent  variables  are 
specified.  Thus  if  u  =  x  +  y  +  z,  x,  y,  and  2  being  the  independent  variables, 
then  du/dx  =  1.  But  if  we  regard  u  as  a  function  of  the  variables  x,  x  +  y  =  77, 
and  x  +  y  +  z  =  C,  so  that  u  =  (,  then  du/dx  =  0.] 


153.  Differentiation  of  a  function  of  two  functions.  There  is  a 
theorem  concerning  the  differentiation  of  a  function  of  one  variable,  known 
generally  as  the  Theorem  of  the  Total  Differential  Coefficient,  which 
is  of  very  great  importance  and  depends  on  the  notions  explained  in  the 
preceding  section  regarding  functions  of  two  variables.  This  theorem  gives 
us  a  rule  for  differentiating 


/{0(t),V^)}, 


with  respect  to  t. 

Let  us  suppose,  in  the  first  instance,  that  f(x,y )  is  a  function  of  the 
two  variables  x  and  y,  and  that  f'x,  fy  are  continuous  functions  of  both 
variables  (§  107)  for  all  of  their  values  which  come  in  question.  And  now 
let  us  suppose  that  the  variation  of  x  and  y  is  restricted  in  that  (x,  y )  lies 
on  a  curve 

x  =  0(t),  y  =  $(t), 

where  (j)  and  -0  are  functions  of  t  with  continuous  differential  coefficients 
Then  f(x,y )  reduces  to  a  function  of  the  single  variable  t, 
say  F(t).  The  problem  is  to  determine  F'(t). 

Suppose  that,  when  t  changes  to  t  +  r,  x  and  y  change  to  x  +  £  and 
y  +  rj.  Then  by  definition 


dF(t) 

dt 


r— >0  T 

lim  ~{f(x  +  £,y  +  rf)  -  f(x,y)} 

T 


lim 


f(x  +  £,y  +  v)  -  f(x,y  +  T])  £  f(x,y  +  7])  -  f(x,y )  y 


T 


V 


T 
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But,  by  the  Mean  Value  Theorem, 

{f(x  +  £,y  +  y)  -  f(x,y  +  v)}/£  =  fx(x  +  0ty  +  v), 

{f(x,y  +  T})  -  f(x,y)}/rj  =  f'y{x,  y  +  ffrj), 

where  6  and  O'  each  he  between  0  and  1.  Asr— 0,  £  — >■  0  and  77  — >■  0,  and 
£/r  — »  (f>'(t),  t]/t  — >  also 

f'x(x  +  9Z>y  +  v)  ->  fx(x>y)%  lyfay  +  Vv)  ->  fy(x,y)- 


Hence 


F\t)  =  Dtf{(/>(t),il>(t)}  =  f'x{x,y)(j)'{t)  +  fy(x,y)ij>'(t), 

where  we  are  to  put  x  =  <f>(t),  y  =  i/;(t)  after  carrying  out  the  differenti¬ 
ations  with  respect  to  x  and  y.  This  result  may  also  be  expressed  in  the 
form 

df  df  dx  df  dy 
dt  dx  dt  dy  dt 

Examples  LXI.  1.  Suppose  <f>(t)  =  (1  —  f2)/(l  +  t2),  ip(t)  =  2t/(l  +  t2), 
so  that  the  locus  of  (x,y)  is  the  circle  x2  +  y1  =  1.  Then 

=  —At/ (1  +  t2)2,  ip'(t)  =  2(1  -  t2)/(l  +  t2)2, 

F\t )  =  {—At/ (1  +  t2)2}/'  +  {2(1  -  t2)/(l  +  t2)2}/;, 

where  x  and  y  are  to  be  put  equal  to  (1  —  t2)/(l  +  t2)  and  2t/(l  +  t2)  after 
carrying  out  the  differentiations. 

We  can  easily  verify  this  formula  in  particular  cases.  Suppose,  e.g.,  that 
f(x,y)  =  x2  +  y2-  Then  f'x  =  2x,  f  =  2 y,  and  it  is  easily  verified  that 
F'(t)  =  2 xcj)' (t)  +  2 yip'(t)  =  0,  which  is  obviously  correct,  since  F(t)  =  1. 

2.  Verify  the  theorem  in  the  same  way  when  (a)  x  =  tm ,  y  =  1  —  tm, 
f(x,  y)  =  x  +  y;  (6)  x  =  a  cos  t,  y  =  asint,  f(x,  y)  =  x2  +  y2. 

3.  One  of  the  most  important  cases  is  that  in  which  t  is  x  itself.  We  then 
obtain 


Dxf{x,  'd(x)}  =  Dxf(x,  y)  +  Dyf(x ,  y)ip'(x). 
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where  y  is  to  be  replaced  by  ip(x)  after  differentiation. 

It  was  this  case  which  led  to  the  introduction  of  the  notation  df  /d x,  df  /dy. 
For  it  would  seem  natural  to  use  the  notation  df  /dx  for  either  of  the  functions 
Dxf{x,if(x)}  and  Dxf(x,y),  in  one  of  which  y  is  put  equal  to  if(x)  before  and 
in  the  other  after  differentiation.  Suppose  for  example  that  y  =  1  —  x  and 
f{x,  y)  =  x  +  y.  Then  Dxf(x,  1  -  x)  =  Dx  1  =  0,  but  Dxf(x,  y)  =  1. 

The  distinction  between  the  two  functions  is  adequately  shown  by  denoting 
the  first  by  df  /dx  and  the  second  by  df  /dx,  in  which  case  the  theorem  takes 
the  form 


df_  =  df_  +  df_  dy 

dx  dx  dy  dx  ’ 


though  this  notation  is  also  open  to  objection,  in  that  it  is  a  little  misleading  to 
denote  the  functions  f{x,ip(x)}  and  f(x,y),  whose  forms  as  functions  of  x  are 
quite  different  from  one  another,  by  the  same  letter  /  in  df  /dx  and  df  /dx. 

4.  If  the  result  of  eliminating  t  between  x  =  f(t),  y  =  if(t)  is  f(x,y)  =  0, 
then 


df  dx  df  dy 
dx  dt  dy  dt 


5.  If  x  and  y  are  functions  of  t,  and  r  and  0  are  the  polar  coordinates  of 
(x,y),  then  r’  =  (xx1  +  yy’)/r,  9 '  =  {xy’  —  yx')/r2,  dashes  denoting  differentia¬ 
tions  with  respect  to  t. 


154.  The  Mean  Value  Theorem  for  functions  of  two  variables. 

Many  of  the  results  of  the  last  chapter  depended  upon  the  Mean  Value 
Theorem,  expressed  by  the  equation 

4>(x  +  h)  —  4>(x)  =  hf\x  +  9h), 

or  as  it  may  be  written,  if  y  —  4>(x), 

8y  =  f\x  +  9  Sx)  Sx. 

Now  suppose  that  z  =  f(x,y )  is  a  function  of  the  two  independent 
variables  x  and  y,  and  that  x  and  y  receive  increments  h,  k  or  5x,  8y 
respectively:  and  let  us  attempt  to  express  the  corresponding  increment 
of  z,  viz. 

Sz  =  f(x  +  h,  y  +  k)  -  f(x,  y), 
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in  terms  of  h,  k  and  the  derivatives  of  z  with  respect  to  x  and  y. 
Let  f(x  +  ht,y  +  kt )  =  F(t).  Then 

f{x  +  h,y  +  k)~  f{x,  y )  =  F(l)  -  F{ 0)  =  F'(9), 

where  0  <  9  <  1.  But,  by  §  153, 

F\t)  =  Dtf(x  +  ht,  y  +  kt) 

=  hfx(x  +  ht,  y  +  kt)  +  kfy(x  +  ht,  y  +  kt). 


Hence  finally 

Sz  =  f(x  +  h,y  +  k)  -  f(x,y)  =  hfx(x  +  9h,y  +  9k)  +  kfy(x  +  9h,y  +  9k), 

which  is  the  formula  desired.  Since  f'x,  /'  are  supposed  to  be  continuous 
functions  of  x  and  y,  we  have 

fxix  +  6h,y  +  9k)  =  fx(x,  y)  +  eKk, 
fy(x  +  9h,  y  +  9k)  =  fy{x,  y)  +  yh)k, 

where  ek,k  and  r)k,k  tend  to  zero  as  h  and  k  tend  to  zero.  Hence  the  theorem 
may  be  written  in  the  form 

=  ifx  + e) Sx  +  ify  +  v)  (!) 

where  e  and  y  are  small  when  Sx  and  5y  are  small. 

The  result  embodied  in  (1)  may  be  expressed  by  saying  that  the  equa¬ 
tion 

5z  =  fjx  +  fy5y 

is  approximately  true;  i.e.  that  the  difference  between  the  two  sides  of  the 
equation  is  small  in  comparison  with  the  larger  of  Sx  and  Sy.*  We  must  say 
Lthe  larger  of  Sx  and  Sy ’  because  one  of  them  might  be  small  in  comparison 
with  the  other;  we  might  indeed  have  Sx  =  0  or  Sy  =  0. 


Or  with  | (5a; |  +  \Sy\  or  \J Sx2  +  6y2. 
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It  should  be  observed  that  if  any  equation  of  the  form  5z  =  A  8x  +  p  5y  is 
‘approximately  true’  in  this  sense,  we  must  have  A  =  f'x,  p  =  f .  For  we  have 

8z  —  f'x5x  —  fySy  =  e5x  +  rj 5y ,  Sz  —  \8x  —  p8y  =  e'  8x  +  rj  8y 

where  e,  77,  e',  rj  all  tend  to  zero  as  8x  and  8y  tend  to  zero;  and  so 

(A  -  f'x)  8x  +  (p-  fy )  8y  =  p8x  +  p ’  8y 

where  p  and  p'  tend  to  zero.  Hence,  if  £  is  any  assigned  positive  number,  we  can 
choose  a  so  that 


I  (A  -  f'x )  8x  +  (p-  fy)  8y\  <  C(IH  +  |Ay|) 

for  all  values  of  8x  and  8y  numerically  less  than  a.  Taking  8y  =  0  we  obtain 
|  (A  —  fx)  8x |  <  C|<5a:| ,  or  |A  —  f'x\  <  (,  and,  as  £  may  be  as  small  as  we  please, 
this  can  only  be  the  case  if  A  =  fx.  Similarly  p  = 

155.  Differentials.  In  the  applications  of  the  Calculus,  especially  in 
geometry,  it  is  usually  most  convenient  to  work  with  equations  expressed 
not,  like  equation  (1)  of  §  154,  in  terms  of  the  increments  8x,  8y,  8z  of 
the  functions  x,  y,  z,  but  in  terms  of  what  are  called  their  differentials 
dx ,  dy,  dz. 

Let  us  return  for  a  moment  to  a  function  y  =  f(x)  of  a  single  variable  x. 
If  f'(x)  is  continuous  then 


5y  =  {f'(x)  +  e}6x,  (1) 

where  e  — »  0  as  8x  — >  0:  in  other  words  the  equation 

8y  =  f\x)8x  (2) 

is  ‘approximately’  true.  We  have  up  to  the  present  attributed  no  meaning 
of  any  kind  to  the  symbol  dy  standing  by  itself.  We  now  agree  to  define  dy 
by  the  equation 


dy  =  f\x)  8x. 


(3) 
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If  we  choose  for  y  the  particular  function  x,  we  obtain 


dx  =  bx, 


so  that 

dy  =  f(x)  dx. 

If  we  divide  both  sides  of  (5)  by  dx  we  obtain 


(4) 

(5) 

(6) 


where  dy/dx  denotes  not,  as  heretofore,  the  differential  coefficient  of  y,  but 
the  quotient  of  the  differentials  dy,  dx.  The  symbol  dy/dx  thus  acquires 
a  double  meaning;  but  there  is  no  inconvenience  in  this,  since  (6)  is  true 
whichever  meaning  we  choose. 

The  equation  (5)  has  two  apparent  advantages  over  (2).  It  is  exact  and  not 
merely  approximate,  and  its  truth  does  not  depend  on  any  assumption  as  to 
the  continuity  of  f'(x).  On  the  other  hand  it  is  precisely  the  fact  that  we  can, 
under  certain  conditions,  pass  from  the  exact  equation  (5)  to  the  approximate 
equation  (2),  which  gives  the  former  its  importance.  The  advantages  of  the  ‘dif¬ 
ferential’  notation  are  in  reality  of  a  purely  technical  character.  These  technical 
advantages  are  however  so  great,  especially  when  we  come  to  deal  with  functions 
of  several  variables,  that  the  use  of  the  notation  is  almost  inevitable. 

When  f(x)  is  continuous,  we  have 


Km  dl  =  1 
by 


when  bx  —>  0.  This  is  sometimes  expressed  by  saying  that  dy  is  the  principal 
part  of  by  when  bx  is  small,  just  as  we  might  say  that  ax  is  the  ‘principal  part’ 
of  ax  +  bx 2  when  x  is  small. 

We  pass  now  to  the  corresponding  definitions  connected  with  a  func¬ 
tion  z  of  two  independent  variables  x  and  y.  We  define  the  differential  dz 
by  the  equation 


dz  =  f'xSx  +  fy  by. 


(7) 
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Putting  z  =  x  and  z  —  y  in  turn,  we  obtain 

dx  =  Sx,  dy  =  5y,  (8) 


so  that 


dz  -  J",  dx  -  /'  dy, 


(9) 


which  is  the  exact  equation  corresponding  to  the  approximate  equation  (1) 
of  §  154.  Here  again  it  is  to  be  observed  that  the  former  is  of  importance 
only  for  reasons  of  practical  convenience  in  working  and  because  the  latter 
can  in  certain  circumstances  be  deduced  from  it. 

One  property  of  the  equation  (9)  deserves  special  remark.  We  saw  in  §  153 
that  if  z  =  f{x,y),  x  and  y  being  not  independent  but  functions  of  a  single 
variable  t,  so  that  z  is  also  a  function  of  t  alone,  then 

dz  df  dx  df  dy 
dt  dx  dt  dy  dt' 

Multiplying  this  equation  by  dt  and  observing  that 


dx 


dx 

dt 


dt, 


dy 


dz 


dz 

dt 


dt, 


we  obtain 

dz  =  f'x  dx  +  f'y  dy, 

which  is  the  same  in  form  as  (9).  Thus  the  formula  which  expresses  dz  in  terms 
of  dx  and  dy  is  the  same  whether  the  variables  x  and  y  are  independent  or  not. 
This  remark  is  of  great  importance  in  applications. 

It  should  also  be  observed  that  if  z  is  a  function  of  the  two  independent 
variables  x  and  y,  and 

dz  =  Xdx  +  y  dy, 

then  A  =  f'x,  y  =  f'y.  This  follows  at  once  from  the  last  paragraph  of  §  154. 

It  is  obvious  that  the  theorems  and  definitions  of  the  last  three  sections  are 
capable  of  immediate  extension  to  functions  of  any  number  of  variables. 
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Examples  LXII.  1.  The  area  of  an  ellipse  is  given  by  A  =  nab,  where 
a,  b  are  the  semiaxes.  Prove  that 

dA  da  db 
A  a  b  ’ 

and  state  the  corresponding  approximate  equation  connecting  the  increments  of 
the  axes  and  the  area. 

2.  Express  A,  the  area  of  a  triangle  ABC,  as  a  function  of  (i)  a,  B,  C, 
(ii)  A,  b,  c,  and  (iii)  a,  b,  c,  and  establish  the  formulae 


dA  da  c  dB  b  dC 

TV  =  ^ - '  7 — r  + 

A  a  a  sin  B 


dA  ,  ,  .  db  dc 

— —  =  cot  A  dA  +  — — | - , 

A  be 


a  sin  C  ’ 

dA  =  R( cos  A  da  +  cos  B  db  +  cos  C  dc) 


where  R  is  the  radius  of  the  circumcircle. 

3.  The  sides  of  a  triangle  vary  in  such  a  way  that  the  area  remains  constant, 
so  that  a  may  be  regarded  as  a  function  of  b  and  c.  Prove  that 

da  cos  B  da  cos  C 

db  cos  A  ’  dc  cos  A 

[This  follows  from  the  equations 

da  3d 

da  =  —  db  +  —  dc,  cos  A  da  +  cos  B  db  +  cos  C  dc  =  0.] 
db  dc 


4.  If  a,  b,  c  vary  so  that  R  remains  constant,  then 


da  db  dc 
cos  A  cos  B  cos  C 


=  0, 


and  so 

da  cos  A  da  cos  A 

db  cos  B '  dc  cos  C ' 

[Use  the  formulae  a  =  2 R  sin  A,  ... ,  and  the  facts  that  R  and  A  +  B  +  C  are 
constant.] 

5.  If  z  is  a  function  of  u  and  v,  which  are  functions  of  x  and  y,  then 


dz 

dx 


dz  du 
du  dx 


+ 


dz  dv 
dv  dx' 


dz 

dy 


dz  du 
du  dy 


+ 


dz  dv 
dv  dy 
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[We  have 


,  dz  dz 
dz  =  —  du  +  —  dv, 
du  dv 


,  du  ,  du  , 
du  =  —  dx  +  —  dy , 
ax  ay 


dv  =  —  dx  +  —  dy. 
ax  dy 


Substitute  for  du  and  dv  in  the  first  equation  and  compare  the  result  with  the 
equation 


dz 


dz 

dx 


dx  + 


dz 

dy 


dy.} 


6.  Let  z  be  a  function  of  x  and  y,  and  let  X,  Y,  Z  be  defined  by  the 
equations 


x  =  a\X  +  b\Y  +  c\Z ,  y  =  a2X  +  b-2Y  +  c2Z ,  z  =  a3V  +  b3Y  +  c3Z. 

Then  Z  may  be  expressed  as  a  function  of  X  and  Y.  Express  dZ/dX,  dZ/dY 
in  terms  of  dz/dx,  dz/dy.  [Let  these  differential  coefficients  be  denoted  by  P,  Q 
and  p.  q.  Then  dz  —  pdx  —  qdy  =  0,  or 


(crp  +  c2q  -  c3)  dZ  +  ( aip  +  a2q  -  03)  dX  +  (b\p  +  b2q  -  b3)  dY  =  0. 


Comparing  this  equation  with  dZ  —  P  dX  —  Q  dY  =  0  we  see  that 


aip  +  a2q  -  a3 
cip  +  c2q  -  c3  ’ 


Q  = 


bpp  +  b2q  -  b3 
crp  +  c2q  -  c3  ‘ 


7.  If 


(aix  +  b\y  +  c\z)p  +  (a2x  +  b2y  +  c2z)q  =  a3x  +  b3y  +  c3z, 


then 


(aiX  +  bxY  +  ci  Z)P  +  {a2X  +  b2Y  +  c2Z)Q  =  a3X  +  b3Y  +  c3Z. 

(Math.  Trip.  1899.) 

8.  Differentiation  of  implicit  functions.  Suppose  that  /(x,  y)  and  its 

derivative  /' (x,  y)  are  continuous  in  the  neighbourhood  of  the  point  (a,  b),  and 
that 


f(a,b)  =  0,  fb(a,b)^  0. 
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Then  we  can  find  a  neighbourhood  of  (a,  b )  throughout  which  fy(x,  y )  has  always 
the  same  sign.  Let  us  suppose,  for  example,  that  f'y(x.  y)  is  positive  near  (a,  b). 
Then  f(x,  y )  is,  for  any  value  of  x  sufficiently  near  to  a,  and  for  values  of  y 
sufficiently  near  to  b,  an  increasing  function  of  y  in  the  stricter  sense  of  §  95.  It 
follows,  by  the  theorem  of  §  108,  that  there  is  a  unique  continuous  function  y 
which  is  equal  to  b  when  x  =  a  and  which  satisfies  the  equation  f(x,  y)  =  0  for 
all  values  of  x  sufficiently  near  to  a. 

Let  us  now  suppose  that  f(x,y )  possesses  a  derivative  f'x{x,y)  which  is  also 
continuous  near  (a,  b).  If  f(x,  y)  =  0,  x  =  a  +  h,  y  =  b  +  k,  we  have 

0  =  f(x,  y)  -  /(a,  b)  =  (/'  +  e)h  +  (fb  +  rj)k, 

where  e  and  rj  tend  to  zero  with  h  and  k.  Thus 

k_  f'g  +  e  ft 

h  ft  +  V  ft' 

or 

dv  =  _ft 

dx  ft ' 

9.  The  equation  of  the  tangent  to  the  curve  f(x,  y)  =  0,  at  the  point  xq,  yo, 

is 

{x  -  x0)f'Xo(x0,yo )  +  (y~  m)fyo(x0,y0)  =  0. 


156.  Definite  Integrals  and  Areas.  It  will  be  remembered  that, 
in  Ch.  VI,  §  145,  we  assumed  that,  if  f(x)  is  a  continuous  function  of  x, 
and  PQ  is  the  graph  of  y  =  f(x),  then  the  region  PpqQ  shown  in  Fig.  47 
has  associated  with  it  a  definite  number  which  we  call  its  area.  It  is  clear 
that,  if  we  denote  Op  and  Oq  by  a  and  x,  and  allow  x  to  vary,  this  area  is 
a  function  of  x,  which  we  denote  by  F(x). 

Making  this  assumption,  we  proved  in  §  145  that  F'(x)  =  f(x),  and 
we  showed  how  this  result  might  be  used  in  the  calculation  of  the  areas  of 
particular  curves.  But  we  have  still  to  justify  the  fundamental  assumption 
that  there  is  such  a  number  as  the  area  F(x). 

We  know  indeed  what  is  meant  by  the  area  of  a  rectangle,  and  that  it 
is  measured  by  the  product  of  its  sides.  Also  the  properties  of  triangles, 
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parallelograms,  and  polygons  proved  by  Euclid  enable  us  to  attach  a  defi¬ 
nite  meaning  to  the  areas  of  such  figures.  But  nothing  which  we  know  so 
far  provides  us  with  a  direct  definition  of  the  area  of  a  figure  bounded  by 
curved  lines.  We  shall  now  show  how  to  give  a  definition  of  F(x)  which 
will  enable  us  to  prove  its  existence.* 

Let  us  suppose  fix)  continuous  throughout  the  interval  [a,  6],  and  let 
us  divide  up  the  interval  into  a  number  of  sub-intervals  by  means  of  the 
points  of  division  x0,  Xi,  x2,  . . . ,  xn,  where 

a  =  x0  <  x !<•••<  z„_i  <  xn  =  b. 

Further,  let  us  denote  by  5U  the  interval  [xu,xv+i\,  and  by  mu  the  lower 
bound  (§  102)  of  f(x)  in  Su,  and  let  us  write 

s  =  m060  +  mi5i  H - b  mn5n  = 

say. 

It  is  evident  that,  if  M  is  the  upper  bound  of  f(x)  in  [a,b],  then 
s  ^  M(b  —  a).  The  aggregate  of  values  of  s  is  therefore,  in  the  language  of 


*The  argument  which  follows  is  modelled  on  that  given  in  Goursat’s  Cours  d’ Analyse 
(second  edition),  vol.  i,  pp.  171  et  seq.;  but  Goursat’s  treatment  is  much  more  general. 
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§  80,  bounded  above,  and  possesses  an  upper  bound  which  we  will  denote 
by  j.  No  value  of  s  exceeds  j,  but  there  are  values  of  s  which  exceed  any 
number  less  than  j. 

In  the  same  way,  if  Mv  is  the  upper  bound  of  f{x)  in  5U,  we  can  define 
the  sum 

5  =  £MA. 

It  is  evident  that,  if  m  is  the  lower  bound  of  f(x)  in  [a,b\,  then 
S  ^  m(b  —  a).  The  aggregate  of  values  of  S  is  therefore  bounded  below, 
and  possesses  a  lower  bound  which  we  will  denote  by  J .  No  value  of  S 
is  less  than  J,  but  there  are  values  of  S  less  than  any  number  greater 
than  J. 

It  will  help  to  make  clear  the  significance  of  the  sums  s  and  S  if  we  observe 
that,  in  the  simple  case  in  which  f{x)  increases  steadily  from  x  =  a  to  x  =  b, 
mu  is  f{xv)  and  Mv  is  f{xu+ 1).  In  this  case  s  is  the  total  area  of  the  rectangles 
shaded  in  Fig.  48,  and  S  is  the  area  bounded  by  a  thick  line.  In  general  s  and  S 


will  still  be  areas,  composed  of  rectangles,  respectively  included  in  and  including 
the  curvilinear  region  whose  area  we  are  trying  to  define. 

We  shall  now  show  that  no  sum  such  as  s  can  exceed  any  sum  such 
as  S.  Let  s,  S  be  the  sums  corresponding  to  one  mode  of  subdivision,  and 
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s',  S'  those  corresponding  to  another.  We  have  to  show  that  s  ^  S'  and 
s’  SS. 

We  can  form  a  third  mode  of  subdivision  by  taking  as  dividing  points 
all  points  which  are  such  for  either  s,  S  or  s',  S'.  Let  s,  S  be  the  sums 
corresponding  to  this  third  mode  of  subdivision.  Then  it  is  easy  to  see  that 

s  ^  s,  s  v  s',  s^s,  s  <;  S".  (1) 

For  example,  s  differs  from  s  in  that  at  least  one  interval  5U  which  occurs 
in  s  is  divided  into  a  number  of  smaller  intervals 

V,  1  )  V.2  7  •  ■  ■  ) 

so  that  a  term  mv8v  of  s  is  replaced  in  s  by  a  sum 

T  mu^u,2  “h  *  *  * 

where  mVj i,  mVj 2,  . . .  are  the  lower  bounds  of  f(x)  in  8Vj  1,  8Uj 2,  •  •  •  •  But 
evidently  mv.  1  ^  mv,  m^2  =  mu ,  •  •  • ,  so  that  the  sum  just  written  is 
not  less  than  m V8V.  Hence  s  ^  s;  and  the  other  inequalities  (1)  can  be 
established  in  the  same  way.  But,  since  s  ^  S,  it  follows  that 

s  ^  s  <;  s  ^  s', 

which  is  what  we  wanted  to  prove. 

It  also  follows  that  j  V  J .  For  we  can  hnd  an  s  as  near  to  j  as  we 
please  and  an  S  as  near  to  J  as  we  please,*  and  so  j  >  J  would  involve  the 
existence  of  an  s  and  an  S  for  which  s  >  S. 

So  far  we  have  made  no  use  of  the  fact  that  f(x)  is  continuous.  We 
shall  now  show  that  j  =  J ,  and  that  the  sums  s,  S  tend  to  the  limit  J 
when  the  points  of  division  xu  are  multiplied  indefinitely  in  such  a  way 
that  all  the  intervals  5V  tend  to  zero.  More  precisely,  we  shall  show  that, 
given  any  positive  number  e,  it  is  possible  to  find  5  so  that 

0  J  —  s  <  e,  0  S  —  J  <  e 


The  s  and  the  S  do  not  in  general  correspond  to  the  same  mode  of  subdivision. 
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whenever  5U  <  5  for  all  values  of  u. 


There  is,  by  Theorem  II  of  §  106,  a  number  S  such  that 


Mv  —  mv  <  e/(b  —  a), 


whenever  every  5V  is  less  than  5.  Hence 


S  -  s  =  E(A  ~  mv)  <  e- 


But 


S-s={S-J)  +  {J-j)  +  (j~s); 


and  all  the  three  terms  on  the  right-hand  side  are  positive,  and  therefore 
all  less  than  e.  As  J  —  j  is  a  constant,  it  must  be  zero.  Hence  j  —  J  and 
O^j  —  s  <  e,  0  S[  S'  —  J  <  e,  as  was  to  be  proved. 

We  define  the  area  of  PpqQ  as  being  the  common  limit  of  s  and  S, 
that  is  to  say  J .  It  is  easy  to  give  a  more  general  form  to  this  definition. 
Consider  the  sum 


o’  =  E/A 


where  fu  denotes  the  value  of  f{x)  at  any  point  in  5U.  Then  a  plainly  lies 
between  s  and  S',  and  so  tends  to  the  limit  J  when  the  intervals  8V  tend  to 
zero.  We  may  therefore  define  the  area  as  the  limit  of  a. 

157.  The  definite  integral.  Let  us  now  suppose  that  f(x)  is  a 
continuous  function,  so  that  the  region  bounded  by  the  curve  y  =  f(x), 
the  ordinates  x  =  a  and  x  —  b,  and  the  axis  of  x,  has  a  definite  area.  We 
proved  in  Ch.  VI,  §  145,  that  if  F(x)  is  an  ‘integral  function’  of  f(x),  i.e.  if 


then  the  area  in  question  is  F(b)  —  F(a). 

As  it  is  not  always  practicable  actually  to  determine  the  form  of  F(x), 
it  is  convenient  to  have  a  formula  which  represents  the  area  PpqQ  and 
contains  no  explicit  reference  to  F(x).  We  shall  write 


b 
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The  expression  on  the  right-hand  side  of  this  equation  may  then  be 
regarded  as  being  defined  in  either  of  two  ways.  We  may  regard  it  as  simply 
an  abbreviation  for  F(b)  —  F(a ),  where  F(x)  is  some  integral  function 
of  f(x),  whether  an  actual  formula  expressing  it  is  known  or  not;  or  we 
may  regard  it  as  the  value  of  the  area  PpqQ,  as  directly  defined  in  §  156. 
The  number 

f(x)  dx 

is  called  a  definite  integral;  a  and  b  are  called  its  lower  and  upper 
limits;  f(x)  is  called  the  subject  of  integration  or  integrand;  and  the 

interval  [a,  b]  the  range  of  integration.  The  definite  integral  depends  on 
a  and  b  and  the  form  of  the  function  f(x)  only,  and  is  not  a  function  of  x. 
On  the  other  hand  the  integral  function 

F(x)  =  [  f(x)  dx 


is  sometimes  called  the  indefinite  integral  of  f(x). 

The  distinction  between  the  definite  and  the  indefinite  integral  is  merely  one 

rb 

of  point  of  view.  The  definite  integral  /  f(x)dx  =  F(b)  —  F(a)  is  a  function 

J  a 

of  b,  and  may  be  regarded  as  a  particular  integral  function  of  /(&).  On  the  other 
hand  the  indefinite  integral  F(x)  can  always  be  expressed  by  means  of  a  definite 
integral,  since 

F(x )  =  F(a)  +  f  f{t )  dt. 

J  a 

But  when  we  are  considering  ‘indefinite  integrals’  or  ‘integral  functions’  we 
are  usually  thinking  of  a  relation  between  two  functions,  in  virtue  of  which  one 
is  the  derivative  of  the  other.  And  when  we  are  considering  a  ‘definite  integral’ 
we  are  not  as  a  rule  concerned  with  any  possible  variation  of  the  limits.  Usually 
the  limits  are  constants  such  as  0  and  1;  and 


f1  f(x)dx  =  F(l)-F(0) 
J  o 


is  not  a  function  at  all,  but  a  mere  number. 
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rx 

It  should  be  observed  that  the  integral  /  f(t )  dt,  having  a  differential  co- 

J  a 

efficient  f(x),  is  a  fortiori  a  continuous  function  of  x. 

Since  1/x  is  continuous  for  all  positive  values  of  x,  the  investigations  of 
the  preceding  paragraphs  supply  us  with  a  proof  of  the  actual  existence  of  the 
function  logic,  which  we  agreed  to  assume  provisionally  in  §  128. 

158.  Area  of  a  sector  of  a  circle.  The  circular  functions.  The 

theory  of  the  trigonometrical  functions  cos  a;,  sin  a;,  etc.,  as  usually  pre¬ 
sented  in  text-books  of  elementary  trigonometry,  rests  on  an  unproved 
assumption.  An  angle  is  the  configuration  formed  by  two  straight  lines 
OA,  OP;  there  is  no  particular  difficulty  in  translating  this  ‘geometrical’ 
definition  into  purely  analytical  terms.  The  assumption  comes  at  the  next 
stage,  when  it  is  assumed  that  angles  are  capable  of  numerical  measure¬ 
ment,  that  is  to  say  that  there  is  a  real  number  x  associated  with  the  con- 


P 


figuration,  just  as  there  is  a  real  number  associated  with  the  region  PpqQ 
of  Fig.  47.  This  point  once  admitted,  cos  a;  and  sin  x  may  be  defined  in  the 
ordinary  way,  and  there  is  no  further  difficulty  of  principle  in  the  elabora¬ 
tion  of  the  theory.  The  whole  difficulty  lies  in  the  question,  what  is  the  x 
which  occurs  in  cos  x  and  sin  xl  To  answer  this  question,  we  must  define 
the  measure  of  an  angle,  and  we  are  now  in  a  position  to  do  so.  The  most 
natural  definition  would  be  this:  suppose  that  AP  is  an  arc  of  a  circle  whose 
centre  is  O  and  whose  radius  is  unity,  so  that  OA  =  OP  =  1.  Then  x,  the 
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measure  of  the  angle,  is  the  length  of  the  arc  AP.  This  is,  in  substance,  the 
definition  adopted  in  the  text-books,  in  the  accounts  which  they  give  of 
the  theory  of  ‘circular  measure’.  It  has  however,  for  our  present  purpose, 
a  fatal  defect;  for  we  have  not  proved  that  the  arc  of  a  curve,  even  of  a 
circle,  possesses  a  length.  The  notion  of  the  length  of  a  curve  is  capable 
of  precise  mathematical  analysis  just  as  much  as  that  of  an  area;  but  the 
analysis,  although  of  the  same  general  character  as  that  of  the  preceding 
sections,  is  decidedly  more  difficult,  and  it  is  impossible  that  we  should 
give  any  general  treatment  of  the  subject  here. 

We  must  therefore  found  our  definition  on  the  notion  not  of  length  but 
of  area.  We  define  the  measure  of  the  angle  AOP  as  twice  the  area  of  the 
sector  AOP  of  the  unit  circle. 

Suppose,  in  particular,  that  OA  is  y  =  0  and  that  OP  is  y  —  mx,  where 
m  >  0.  The  area  is  a  function  of  m,  which  we  may  denote  by  4>[m).  If  we 
write  /i  for  (1  +  m2)_2,  P  is  the  point  (/i, m/i),  and  we  have 


4>(m) 


\mn2  + 


—  x 2  dx. 


Differentiating  with  respect  to  m,  we  find 


4>'(rn) 


1 

2(1 +  m2)’ 


4>{m) 


rm  dt 
Jo  1  +  t2 


Thus  the  analytical  equivalent  of  our  definition  would  be  to  define  arc  tan  m 
by  the  equation 


dt 

arc  tan  m  =  /  - -; 

Jo  1  + 1 

and  the  whole  theory  of  the  circular  functions  could  be  worked  out  from 
this  starting  point,  just  as  the  theory  of  the  logarithm  is  worked  out  from 
a  similar  definition  in  Ch.  IX.  See  Appendix  III. 


Examples  LXIII.  Calculation  of  the  definite  from  the  indefinite 
integral.  1.  Show  that 


bn+i  _  an+ 1 


n  +  1 
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and  in  particular  that 


-I 


1 


/  xn  dx  = 

/  o  n  +  1 


2. 


3. 


cos  mx  dx  = 
dx 


sin  mb  —  sin  ma 


sin  mx  dx  = 


cos  ma  —  cos  mb 


m 


m 


=  arc  tan  b  —  arc  tan  a, 


dx 


1  +  x 


2  = 


J  a  1  T  X 

[There  is  an  apparent  difficulty  here  owing  to  the  fact  that  arc  tan  x  is  a 
many  valued  function.  The  difficulty  may  be  avoided  by  observing  that,  in  the 
equation 

fx  dt 

=  arc  tan  x, 


1  +  t2 


arc  tan  x  must  denote  an  angle  lying  between  —  |vr  and  r.  For  the  integral 
vanishes  when  x  =  0  and  increases  steadily  and  continuously  as  x  increases. 
Thus  the  same  is  true  of  arc  tan  x,  which  therefore  tends  to  ^ 7r  as  x  — >  oo.  In 
the  same  way  we  can  show  that  arc  tan  x  — >  —  as  x  — >  — oo.  Similarly,  in  the 
equation 

fx  dt 

/  ,  =  arc  sm  x, 

Jo 

where  —  1  <  x  <  1,  arc  sin  x  denotes  an  angle  lying  between  —  ^vr  and  \n.  Thus, 
if  a  and  b  are  both  numerically  less  than  unity,  we  have 


dx 


la  \/l  -  X: 


=  arc  sin  b  —  arc  sin  a.l 


4. 

5. 


dx 


1  —  x 


2ir 

371’ 


r*l 


dx 


TT 


dx 


I  o  1  +  X  +  X2 

a 


I o  1  +  2x  cos  a  +  x2  2  sin  a 


3V3' 

if  —  tt  <  a  <  7r,  except  when  a  =  0, 
when  the  value  of  the  integral  is  ^ ,  which  is  the  limit  of  cosec  a  as  a  — >  0. 

6.  f  \J \  —  x2  dx  =  j 7 r,  f  a2  —  x2  dx  =  \ira2  (a  >  0). 

Jo  "  Jo 

r ^ 

7.  /  - - -  =  .  ~ ,  if  a  >  1 61.  [For  the  form  of  the  indefinite 

Jo  a  +  6  cos  x  Va2  -  b2  L 

integral  see  Exs.  Lin.  3,  4.  If  |a|  <  |6|  then  the  subject  of  integration  has  an 
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infinity  between  0  and  n.  What  is  the  value  of  the  integral  when  a  is  negative 
and  —a  >  |6|?] 


47r 


dx 


IT 


-,2  i>no2  t"  -)_  5 2  sjn2  x  2 ab 


=  — — ,  if  a  and  b  are  positive.  What  is  the 


j  o  a  cos^  x  +  tr  sin  x 
value  of  the  integral  when  a  and  b  have  opposite  signs,  or  when  both  are  negative? 

9.  Fourier’s  integrals.  Prove  that  if  m  and  n  are  positive  integers  then 

r2ir 


cos  mx  sin  nx  dx 


10 


r2ir 


is  always  equal  to  zero,  and 

/»27T 

/  cosmx  cosnx  dx,  /  sin  mx  sin  nx  dx 

Jo  Jo 

are  equal  to  zero  unless  m  =  n,  when  each  is  equal  to  it. 

/*7T  /*7T 

10.  Prove  that  /  cos  mx  cos  nx  da;  and  /  sin  mx  sin  nx  dx  are  each  equal 

Jo  Jo 

to  zero  except  when  m  =  n,  when  each  is  equal  to  and  that 


cos  mx  sin  nx  dx  = 


2  n 


9  9  ’ 

nz  —  mz 


cos  mx  sin  nx  dx  =  0, 


according  as  n  —  m  is  odd  or  even. 


159.  Calculation  of  the  definite  integral  from  its  definition  as 
the  limit  of  a  sum.  In  a  few  cases  we  can  evaluate  a  definite  integral 
by  direct  calculation,  starting  from  the  definitions  of  §§  156  and  157.  As 
a  rule  it  is  much  simpler  to  use  the  indefinite  integral,  but  the  reader  will 
find  it  instructive  to  work  through  a  few  examples. 

[b 

Examples  LXIV.  1.  Evaluate  /  xdx  by  dividing  [a,  b]  into  n  equal 

J  a 

parts  by  the  points  of  division  a  =  xo,  xi,  X2,  . . . ,  xn  =  b,  and  calculating  the 
limit  as  n  — >  oo  of 


(xi  -  Xo)f(xo)  +  (x2  -  Xi)/(xi)  H - h(xn~  x„_i)/(xri_i). 
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[This  sum  is 
b  —  a 


n 

b  —  a 
n 


b  —  a\  (  b  —  a\  (  .  _,b  —  a 

a  +  (  a  H - )  +  (  o  +  2 -  )  +•••  +  <  a  +  {n  —  1) 


n 


n 


n 


na  +  - — -{1  +  2  H - b  (n  -  1)} 

n 


=  (b  —  a)  <  a  +  (b  —  a) 


n(n  —  1) 
2  n2 


which  tends  to  the  limit  ^(62  —  a2)  as  n  — >  oo.  Verify  the  result  by  graphical 
reasoning.] 

fb  2 

2.  Calculate  /  x  dx  in  the  same  way. 

J  a 

fb 

3.  Calculate  /  xdx,  where  0  <  a  <  b,  by  dividing  [ a,b ]  into  n  parts  by 

J  a 

the  points  of  division  a,  ar,  ar2,  ...,  arn~l ,  arn ,  where  rn  =  b/a.  Apply  the 

fb 

same  method  to  the  more  general  integral  /  xm  dx. 

J  a 

rb  rb 

4.  Calculate  /  cos  rrix  dx  and  /  sin  mx  dx  by  the  method  of  Ex.  1 . 

Ja  Ja 

n—1  \ 

5.  Prove  that  n  V  —5 - 77  — >  jir  as  n  — >  00. 

r= 0  n2  +  r 

[This  follows  from  the  fact  that 


n 

—  + 

n 


n 


2  +  l2 


+  •••  + 


n 


n—1 

E 


(1/n) 


n2  +  (n  —  l)2  ^  1  +  (r  In')2  ’ 

v  )  r=0  \  /  ) 


r ^ 

which  tends  to  the  limit  /  - ^  as  n  — >  00,  in  virtue  of  the  direct  definition 

Jo  1+x2 

of  the  integral.] 

\  n—1  _  /■!  - - 

6.  Prove  that  \/n2  —  r2  — >•  [The  limit  is  /  y  1  —  x2dx.] 

n  r= 0  Jo 


160.  General  properties  of  the  definite  integral.  The  definite 
integral  possesses  the  important  properties  expressed  by  the  following  equa- 
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tions.* 


(1) 


f(x)  dx  =  —  f{x)  dx. 


This  follows  at  once  from  the  definition  of  the  integral  by  means  of  the 
integral  function  F(x),  since  F(b)  —  F(a)  =  —{F(a )  —  F(b)}.  It  should  be 
observed  that  in  the  direct  definition  it  was  presupposed  that  the  upper 
limit  is  greater  than  the  lower;  thus  this  method  of  definition  does  not 

r 

apply  to  the  integral  /  f(x)  dx  when  a  <  b.  If  we  adopt  this  definition  as 

Jb 

fundamental  we  must  extend  it  to  such  cases  by  regarding  the  equation  (1) 
as  a  definition  of  its  right-hand  side. 


(2) 


f(x)  dx 


0. 


(3) 

(4) 

(5) 


f(x)dx+  /  f(x)dx—  /  f(x)dx. 


kf(x)dx  =  k  /  f{x)dx. 


{f(x)  +  4>(x)}  dx  =  /  f(x)dx+  /  4>(x)dx. 


The  reader  will  find  it  an  instructive  exercise  to  write  out  formal  proofs  of 
these  properties,  in  each  case  giving  a  proof  starting  from  (a)  the  definition  by 
means  of  the  integral  function  and  (/3)  the  direct  definition. 

The  following  theorems  are  also  important. 

(6)  If  f(x)  ^  0  when  a  ^  x  ^b,  then  /  f(x)  dx  ^  0. 

J  a 

We  have  only  to  observe  that  the  sum  s  of  §  156  cannot  be  negative.  It  will 
be  shown  later  (Misc.  Ex.  41)  that  the  value  of  the  integral  cannot  be  zero  unless 


*  All  functions  mentioned  in  these  equations  are  of  course  continuous,  as  the  definite 
integral  has  been  defined  for  continuous  functions  only. 
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f[x)  is  always  equal  to  zero:  this  may  also  be  deduced  from  the  second  corollary 
of  §  121. 

(7)  If  H  V  f(x)  V  K  when  a  5)  x  5)  b,  then 

H{b  —  a)ff  f  f(x)dx^K(h  —  a). 

J  a 

This  follows  at  once  if  we  apply  (6)  to  f(x)  —  H  and  K  —  f(x). 

(8)  f  f(x)dx  =  (b-  a)/(0, 

J  a 

where  f  lies  between  a  and  b. 

This  follows  from  (7).  For  we  can  take  H  to  be  the  least  and  K  the  greatest 
value  of  f(x)  in  [a,  b\.  Then  the  integral  is  equal  to  77(6  —  0),  where  77  lies  between 
H  and  K.  But,  since  f(x)  is  continuous,  there  must  be  a  value  of  f  for  which 

m  =  ri  (§  100). 

If  F(x)  is  the  integral  function,  we  can  write  the  result  of  (8)  in  the  form 
F(b)  —  F(a)  =  (6  —  a)F'(£), 

so  that  (8)  appears  now  to  be  only  another  way  of  stating  the  Mean  Value 
Theorem  of  §  125.  We  may  call  (8)  the  First  Mean  Value  Theorem  for 
Integrals. 

(9)  The  Generalised  Mean  Value  Theorem  for  integrals.  If 

(j>{x)  is  positive,  and  H  and  K  are  defined  as  in  (7),  then 

pb  pb  pb 

H  /  4>(x)  dxf^  f(x)4>(x)  dx  V  K  /  ffx)  dx ; 

J  a  J  a  J  a 


and 


f{x)f(x)  dx  =  /(0  /  <f>{x)  dx, 


where  f  is  defined  as  in  (8). 
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This  follows  at  once  by  applying  Theorem  (6)  to  the  integrals 


{. f(x )  -  H}4>(x )  dx, 


{K  —  f(x)}(j)(x)  dx. 


The  reader  should  formulate  for  himself  the  corresponding  result  which  holds 
when  (j>(x)  is  always  negative. 

(10)  The  Fundamental  Theorem  of  the  Integral  Calculus.  The 

function 

F(x)=  f  f  (t)  dt 

J  a 

has  a  derivative  equal  to  f(x). 

This  has  been  proved  already  in  §  145,  but  it  is  convenient  to  restate 
the  result  here  as  a  formal  theorem.  It  follows  as  a  corollary,  as  was  pointed 
out  in  §  157,  that  F(x)  is  a  continuous  function  of  x. 


Examples  LXV.  1.  Show,  by  means  of  the  direct  definition  of  the 
definite  integral,  and  equations  (l)-(5)  above,  that 


(i) 

(b) 


4>{x2)  dx  =  2  /  f(x2)  dx, 


x(f>(x2)  dx  =  0; 


0(cos  x)  dx  = 


r  2n 

/  <Msi 

Jo 


sin  x)  dx  =  \  /  ^(sin  x)  dx~, 


m 


/  f(cos2  x)  dx  =  m  /  4>(cos2  x)  dx, 

/  o  Jo 


m  being  an  integer.  [The  truth  of  these  equations  will  appear  geometrically 
intuitive,  if  the  graphs  of  the  functions  under  the  sign  of  integration  are  sketched.] 

sm  tlx 

2.  Prove  that  /  - dx  is  equal  to  it  or  to  0  according  as  n  is  odd  or  or 

Jo  sm  x 

even.  [Use  the  formula  (sinnx)/(sinx)  =  2  cos{(n  —  l)x}  +  2  cos{(n  —  3)x}  + . . . , 
the  last  term  being  1  or  2cosx.] 


3.  Prove  that  /  sin  nx  cot  x  dx  is  equal  to  0  or  to  tt  according  as  n  is  odd 

Jo 


or  even. 
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4.  If  4>(x)  =  ao  +  a\  cos  x  +  b\  sin  x  +  cos  2 x  +  ■  ■  ■  +  an  cos  nx  +  bn  sin  nx, 
and  k  is  a  positive  integer  not  greater  than  n,  then 

r2n  p2n  /*27t 

/  cf>(x)  dx  =  27rao,  /  cos  kxcj)(x)  dx  =  irak,  /  sin  kxj>(x)  dx  =  irbk- 

Jo  Jo  Jo 

If  k  >  n  then  the  value  of  each  of  the  last  two  integrals  is  zero.  [Use  Ex.  lxiii.  9.] 

5.  If  <J>{x)  =  ao  +  a\  cos  x  +  a2  cos  2 x  +  ■  ■  ■  +  an  cos  nx ,  and  A;  is  a  positive 
integer  not  greater  than  n,  then 


4>(x)  dx  =  7rao, 


cos  kx(j){x)  dx  =  \tx afc. 


If  k  >  n  then  the  value  of  the  last  integral  is  zero.  [Use  Ex.  lxiii.  10.] 
6.  Prove  that  if  a  and  b  are  positive  then 


r2n  dx _ 

Jo  a2  cos2  x  +  b2  sin2  x 


2vr 

ab 


[Use  Ex.  lxiii.  8  and  Ex.  1  above.] 

7.  If  f(x)  ^  ifi(x )  when  a  ^  x  ^b,  then 

8.  Prove  that 


0  < 

/  sinn+1  x  dx  < 

rh 

/  sin  nxdx, 

Jo 

Jo 

0  < 

/  tann+1  x  dx  < 

r  37r 

/  tann  x dx. 

Jo 

Jo 

9.*  If  n  >  1  then 


_  f  2  dx 
5  <  /  . 

Jo  V 1  —  x2n 


<  .524. 


[The  first  inequality  follows  from  the  fact  that  \/\  —  x2n  <  1,  the  second 
from  the  fact  that  \J\  —  x2n  >  \/l  —  x2.\ 


*Exs.  9-13  are  taken  from  Prof.  Gibson’s  Elementary  Treatise  on  the  Calculus. 
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10.  Prove  that 


3  < 


dx  i 

— ,  -  <  hir. 

V4  —  x2  +  x3 


11.  Prove  that  (3x  +  8)/16  <  1/ \/4  —  3x  +  x3  <  1/a/4  —  3x  if  0  <  x  <  1, 
and  hence  that 


19 

32 


< 


dx 


< 


12.  Prove  that 


.573  < 


'o  \/4  —  3x  +  x3  3 

r2  dx 


<  .595. 


/i  \/4  —  3x  +  x3 

[Put  x  =  1  +  u:  then  replace  2  +  3u2  +  u3  by  2  +  4u2  and  by  2  +  3n2.] 
13.  If  a  and  <f>  are  positive  acute  angles  then 


< 


f 

■Jo 


dx 


\/l  —  sin2  a  sin2  x 


< 


\/l  —  sin2  a  sin2  i 


If  a  =  (j)  =  i7r,  then  the  integral  lies  between  .523  and  .541. 


6 

14.  Prove  that 


f{x)  dx 


^  /  \f(x)\dx. 


[If  a  is  the  sum  considered  at  the  end  of  §  156,  and  a'  the  corresponding  sum 
formed  from  the  function  |/(x)|,  then  |<r|  ah] 

15.  If  |/(x)|  ^  M,  then 


/(x)0(x)  dx 


<  M 


(x)|  dx. 


161.  Integration  by  parts  and  by  substitution.  It  follows  from 

§  138  that 

[  f(x)<l>'(x)  dx  =  / (6)0(6)  -  /(a)0(a)  -  f  /'(x)0(x)  dx. 

J  a  J  a 

This  formula  is  known  as  the  formula  for  integration  of  a  definite  in¬ 
tegral  by  parts. 
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Again,  we  know  (§  133)  that  if  F(t)  is  the  integral  function  of  /(£),  then 


Hence,  if  0(a)  =  c,  0(6)  =  d,  we  have 


which  is  the  formula  for  the  transformation  of  a  definite  integral  by  sub¬ 
stitution. 

The  formulae  for  integration  by  parts  and  for  transformation  often  en¬ 
able  us  to  evaluate  a  definite  integral  without  the  labour  of  actually  find¬ 
ing  the  integral  function  of  the  subject  of  integration,  and  sometimes  even 
when  the  integral  function  cannot  be  found.  Some  instances  of  this  will 
be  found  in  the  following  examples.  That  the  value  of  a  definite  integral 
may  sometimes  be  found  without  a  knowledge  of  the  integral  function  is 
only  to  be  expected,  for  the  fact  that  we  cannot  determine  the  general 
form  of  a  function  F(x)  in  no  way  precludes  the  possibility  that  we  may  be 
able  to  determine  the  difference  F(b)  —  F(a)  between  two  of  its  particular 
values.  But  as  a  rule  this  can  only  be  effected  by  the  use  of  more  advanced 
methods  than  are  at  present  at  our  disposal. 

Examples  LX VI.  1.  Prove  that 


■b 


xf"{x )  dx  =  { bf(b )  -  /(&)}  -  {a/' (a)  -  /(a)}. 


2.  More  generally, 


■b 


x 


,m 


F(x)  =  xmf(m\x)  - 

+  m(m  —  l)xm~2  f(m~2)  (x)  —  •  ■  •  +  (— l)mm!  f(x). 


—  mx 


where 
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3.  Prove  that 


arc  sin  x  dx  =  —  1, 


'0 


x  axctanx  dx  = 


4.  Prove  that  if  a  and  b  are  positive  then 
/A 77  xcosxsinxdx 


7 T 


(a2  cos2  x  +  b2  sin2  x)2  4ab2(a  +  b ) 


[Integrate  by  parts  and  use  Ex.  lxiii.  8.] 

5.  If 

r*x  f*X  f*  X 

h(x)=  f(t)dt,  f2(x)=  fi(t)dt,  fk(x)=  fk-i(t)dt, 

Jo  Jo  Jo 

then 

fk(x)  =  JJ7^Yy_  fo  -  t)k~l  dt. 

[Integrate  repeatedly  by  parts.] 

6.  Prove  by  integration  by  parts  that  if 

Um,n  =  f  Xm(l  -  x)ndx, 

Jo 

where  rri  and  n  are  positive  integers,  then  [m  +  n  +  1  )um,n  =  1,  and 

deduce  that 

ml  nl 

Um,n  =  t  :  ,  \  i  * 

(m  +  n  +  lj! 


7.  Prove  that  if 


Urt.  — 


tan" x dx 


Jo 

then  un  +  un-2  =  l/(n  —  1).  Hence  evaluate  the  integral  for  all  positive  integral 
values  of  n. 

[Put  tan"  x  =  tan"-2  x(sec2  x  —  1)  and  integrate  by  parts.] 

8.  Deduce  from  the  last  example  that  un  lies  between  l/{2 (n  —  1)}  and 
l/{2(n  +  l)}. 
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9.  Prove  that  if 


^ n 


sin™  x  dx 


J  o 

then  un  =  {(n  —  1) /n}un-2-  [Write  sinn_1xsinx  for  sin"x  and  integrate  by 
parts.] 

10.  Deduce  that  un  is  equal  to 


2  •  4  •  6  . . .  (n  —  1)  X_1  -3-5...(ra-  1) 

3  •  5  •  7 . . .  n  ’  ^  2-4-6. ..n  ’ 

according  as  n  is  odd  or  even. 

11.  The  Second  Mean  Value  Theorem.  If  f(x)  is  a  function  of  x  which 
has  a  differential  coefficient  of  constant  sign  for  all  values  of  x  from  x  =  a  to 
x  =  b,  then  there  is  a  number  £  between  a  and  b  such  that 


rb  rb 

/  f(x)cj>(x)  dx  =  f{a )  /  4>(x)  dx  +  f(b)  /  (j>(x)  dx. 
/ a  J  a  J  £ 


[Let  /  cj>(t)  dt  =  <h(x).  Then 


»  rb  rb 

f(x)4>(x)  dx  =  /  f(x)<b'(x)dx  =  f(b)Q(b)—  /  f'(x)&(x)  dx 
J  a  J  a 


=  f(b)m-m  /  f(x)dx, 


by  the  generalised  Mean  Value  Theorem  of  §  160:  i.e. 

[  f{x)4>{x)  dx  =  f(b)$(b)  +  {/(a)  -  /(6)}4>(£), 


which  is  equivalent  to  the  result  given.] 

12.  Bonnet’s  form  of  the  Second  Mean  Value  Theorem.  If  f(x)  is 
of  constant  sign,  and  f(b)  and  f(a)  —  f{b)  have  the  same  sign,  then 
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where  X  lies  between  a  and  b.  [For  f(b)$>(b)  +  {f(a)  —  /(&)}<&(£)  =  /z/(a),  where 
H  lies  between  <J>(£)  and  <h(6),  and  so  is  the  value  of  4>(x)  for  a  value  of  x  such 
as  X.  The  important  case  is  that  in  which  0  ^  f(b)  ^  f(x)  ^  /(«)•] 

Prove  similarly  that  if  /(a)  and  f(b)  —  f{a )  have  the  same  sign,  then 

/  f{x)4>{x )  dx  =  f(b)  f  4>{x)  dx, 

Ja  JX 

fb 

where  X  lies  between  a  and  b.  [Use  the  function  T(^)  =  /  4>(x)dx.  It  will  be 

h 

found  that  the  integral  can  be  expressed  in  the  form 

/(«)*(«)  +  {/(&)- /(«)}*(£). 


The  important  case  is  that  in  which  0  ^  f(a)  5^  fix)  =  f(b)-] 


13.  Prove  that 


dx 


if  X'  >  X  >  0.  [Apply  the  first  formula  of  Ex.  12,  and  note  that  the  integral  of 
sinx  over  any  interval  whatever  is  numerically  less  than  2.] 

14.  Establish  the  results  of  Ex.  LXV.  1  by  means  of  the  rule  for  substitution. 
[In  (i)  divide  the  range  of  integration  into  the  two  parts  [— a,  0],  [0,  a],  and  put 
x  =  —y  in  the  first.  In  (ii)  use  the  substitution  x  =  \tt  —  y  to  obtain  the  first 
equation:  to  obtain  the  second  divide  the  range  [0, 7 r]  into  two  equal  parts  and 
use  the  substitution  x  =  ^ir  +  y.  In  (iii)  divide  the  range  into  m  equal  parts  and 
use  the  substitutions  x  =  ir  +  y,  x  =  2ir  +  y,  . . . .] 

15.  Prove  that 


F(x)  dx  =  /  F(a  +  b  —  x)  dx. 


16.  Prove  that 


cosm  x  sinm  xdx  =  2~m 


L 


cosm  x  dx. 


17.  Prove  that 


x^(sinx)  dx 


/*7T 

r  /  0(sinx)dx. 

Jo 
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[Put  x  =  7T  —  y.\ 

18.  Prove  that 


x  sm  x 


dx  =  4vr2. 


J o  1  +  cos2  X  4 

19.  Show  by  means  of  the  transformation  x  =  a  cos2  9  +  b  sin2  6  that 

rb 

/  y/ (x  —  a)(b  —  x)  dx  =  g7r (b  —  a)2 . 

J  a 

20.  Show  by  means  of  the  substitution  (a  +  b cos x) (a  —  b cosy)  =  a?  —  b2 
that 

/*  7T  /*7T 

/  (o  +  bcosx)~n  dx  =  (a2  —  62)Vn-5)  /  (a  —  frcosy)"^1  dy, 

Jo  Jo 

when  n  is  a  positive  integer  and  a  >  |6|,  and  evaluate  the  integral  when  n  =  1, 
2,  3. 

21.  If  m,  and  n  are  positive  integers  then 

777  I  77  ! 

/  {x-a)m(b-x)ndx  =  (b-a)m+n+1- - 

J  a  v™  i  “r  lj! 

[Put  x  =  a  +  (b  —  a)y,  and  use  Ex.  6.] 

162.  Proof  of  Taylor’s  Theorem  by  Integration  by  Parts.  We 

shall  now  give  the  alternative  form  of  the  proof  of  Taylor’s  Theorem  to 
which  we  alluded  in  §  147. 

Let  f(x)  be  a  function  whose  first  n  derivatives  are  continuous,  and  let 
Fn(x)  =  fib)  -  f{x )  -  {b  -  x)f  0) - ^(w_i)i 


Then 


and  so 


KM  - 


Fn{a)  =  Fn(b)  -  /  F'Jx)  dx  = 


Ja 


(b  —  x)n  1fl'n\x)dx. 
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If  now  we  write  a  +  h  for  b,  and  transform  the  integral  by  putting  x  =  a+th, 
we  obtain 


h 


71—1 


/(a  +  h)  —  f(a)  +  hf'(a )  +  •  •  •  +  - -tt/*"  ^(a)  +  Rni  (1) 


(n  —  1)! ' 


where 


/?  hK  I 

n  (n-l)\J0 


•  i 


(1  -t)n~iyn\a  +  th)dt. 


(2) 


Now,  if  p  is  any  positive  integer  not  greater  than  n,  we  have,  by  Theo¬ 
rem  (9)  of  §  160, 

[  (1  -t)n-1fH(a  +  th)dt=  f  [l- t)n~v {l- ty-1fn\a  +  th)dt 

Jo  Jo 

=  (1  -6)n-pf{n\a  +  6h)  [  (1  -tf-Ut, 

Jo 

where  0  <  6  <  1.  Hence 


Rn  = 


(1  -  ey~p yn\a  +  6h)hr 

p(n  —  1)! 


(3) 


If  we  take  p  =  n  we  obtain  Lagrange’s  form  of  Rn  (§  148).  If  on  the 
other  hand  we  take  p  =  1  we  obtain  Cauchy’s  form,  viz. 


Rn  = 


(1  -  ey-1  yn\a  +  0h)hr' 

(n  —  1)! 


(4) 


163.  Application  of  Cauchy’s  form  to  the  Binomial  Series.  If 

fix)  =  (1  +  x)m,  where  m  is  not  a  positive  integer,  then  Cauchy’s  form  of  the 
remainder  is 

m{m  —  1) . . .  (m  —  n  +  1)  (1  —  9)n~lxn 
"  “  1  ■  2. . .  (n  -  1)  (1  +  0x)n~m  ' 

*The  method  used  in  §  147  can  also  be  modified  so  as  to  obtain  these  alternative 
forms  of  the  remainder. 
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Now  (1  —  9)/{\  +  Ox)  is  less  than  unity,  so  long  as  —  1  <  x  <  1,  whether  x  is 
positive  or  negative;  and  (1  +  6x)m~l  is  less  than  a  constant  K  for  all  values 
of  n,  being  in  fact  less  than  (l  +  |x|)m_1  if  m  >  1  and  than  (1  —  |x|)m_1  if  m  <  1. 
Hence 


Rn |  <  I\\m 


I'm  —  1 
\n  —  1 


•E  |  —  Pm 


say.  But  pn  — >  0  as  n  — >  oo,  by  Ex.  xxvn.  13,  and  so  Rn  — >  0.  The  truth  of 
the  Binomial  Theorem  is  thus  established  for  all  rational  values  of  m  and  all 
values  of  x  between  —1  and  1.  It  will  be  remembered  that  the  difficulty  in  using 
Lagrange’s  form,  in  Ex.  LVI.  2,  arose  in  connection  with  negative  values  of  x. 


164.  Integrals  of  complex  functions  of  a  real  variable.  So  far  we 

have  always  supposed  that  the  subject  of  integration  in  a  definite  integral 
is  real.  We  define  the  integral  of  a  complex  function  f(x )  =  4>(x)  +  iil>(x) 
of  the  real  variable  x ,  between  the  limits  a  and  b,  by  the  equations 


f(x)dx=  /  {4>(x)  +  i'il(x)}  dx  =  /  4>(x)  dx  +  i  /  ^>{x)  dx; 


and  it  is  evident  that  the  properties  of  such  integrals  may  be  deduced  from 
those  of  the  real  integrals  already  considered. 

There  is  one  of  these  properties  that  we  shall  make  use  of  later  on.  It 
is  expressed  by  the  inequality 


f(x)  dx 


=  /  \f{x)\dx* 
J  a 


(i) 


This  inequality  may  be  deduced  without  difficulty  from  the  definitions  of 
§§  156  and  157.  If  8V  has  the  same  meaning  as  in  §  156,  and  i/y  are  the 
values  of  0  and  0  at  a  point  of  5V)  and  fu  =  (pu  +  iiftv,  then  we  have 


(f>dx  +  i 


lim  YjtPv  8V  +  i  lim  8V 

lim  +  H/v)  8V  =  lim  8U, 


The  corresponding  inequality  for  a  real  integral  was  proved  in  Ex.  LXV.  14. 
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and  so 

fdx=  |  lim  A|  =  lim  |EA  Su |; 

while 

|/|  dx  =  lim£|/„|  S„. 

The  result  now  follows  at  once  from  the  inequality 

IEAXI  =  EIAIX- 

It  is  evident  that  the  formulae  (1)  and  (2)  of  §  162  remain  true  when 
/  is  a  complex  function  </>  +  iip. 

MISCELLANEOUS  EXAMPLES  ON  CHAPTER  VII. 

1.  Verify  the  terms  given  of  the  following  Taylor’s  Series: 

(1)  tanx  =  x  +  |x3  +  ^x5  +  . . . , 

(2)  sec  x  =  1  +  \x2  +  Jjx4  +  . . . , 

(3)  x  cosec  x  =  1  +  |x2  +  g|gX4  +  . . . , 

(4)  x  cot  x  =  1  —  ^x2  —  jgx4  —  . . . . 

2.  Show  that  if  /(x)  and  its  first  n  +  2  derivatives  are  continuous,  and 
/(«+U  (0)  +  0,  and  6n  is  the  value  of  6  which  occurs  in  Lagrange’s  form  of  the 
remainder  after  n  terms  of  Taylor’s  Series,  then 

1  n  f/(n+2)(0)  1 

n  ~  n  +  1  +  2(n  +  l)2(n  +  2)  j  /("+!)  (0)  +  C* J  X ’ 

where  ex  — )•  0  as  x  — >•  0.  [Follow  the  method  of  Ex.  LV.  12.] 

3.  Verify  the  last  result  when  /(x)  =  l/(l  +  x).  [Here  (l  +  0nx)n+1  =  1  +  x.] 

4.  Show  that  if  /(x)  has  derivatives  of  the  first  three  orders  then 

f{b)  =  /(a)  +  \{b-  a){/'(a)  +  /'(&)}  -  ±{b  -  a)3/'"  (a ), 
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where  a  <  a  <  b.  [Apply  to  the  function 

f{x)  -  f(a )  -  \{x  -  a){f\a )  +  f\x)} 

~  I/(6)  _  /(a)  -  Ub  ~ 

arguments  similar  to  those  of  §  147.] 

5.  Show  that  under  the  same  conditions 

/0)  =  /(«)  +  (b-  a)f'{\(a  +  b)}  +  ±(b-  a)3/'"  (a). 

6.  Show  that  if  /(x)  has  derivatives  of  the  first  five  orders  then 

f{b)  =  f  (a)  +  \{b-  a) [/'(a)  +  /'(&)  +4/'{^(a  +  b)}]  -  ^m{b  -  af  f[b) {a). 

7.  Show  that  under  the  same  conditions 

f(b)  =  f(a)+\(b-a){f  (a)+ /'  (6)}-I^(6-a)2{/,/  (b)- /"  (a)}+^(6-a)5 /(5)  (a). 


8.  Establish  the  formulae 


(i) 


/(«)  /(&) 

5(a)  5(6) 


(b~  a) 


/(a)  /'(/*) 

5(a)  5'(/?)  ’ 


where  /3  lies  between  a  and  6,  and 


/(a) 

m 

/(c) 

/(a) 

m 

n  7) 

(ii) 

5(a) 

g{b) 

5(c) 

=  \{b-c)(c-  a) (a 

-6) 

5(a) 

m 

5"  (7) 

h{a) 

h(b) 

h{c) 

h(a) 

h'iP) 

V'(7) 

where  /3  and  7  lie  between  the  least  and  greatest  of  a,  b,  c.  [To  prove  (ii)  consider 
the  function 


/(a)  f{b)  /(x) 
5(a)  g{b)  g{x) 
h(a)  h(b )  h(x) 


(x  —  a)(x  —  b) 
(c  —  a)(c  —  5) 


/(a)  /(&)  /(c) 
5(a)  5(6)  5(c)  , 

h{a)  h(b)  h(c) 


which  vanishes  when  x  =  a,  x  =  6,  and  x  =  c.  Its  first  derivative,  by  Theorem  B 
of  §  121,  must  vanish  for  two  distinct  values  of  x  lying  between  the  least  and 
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greatest  of  a,  b,  c;  and  its  second  derivative  must  therefore  vanish  for  a  value  7 
of  x  satisfying  the  same  condition.  We  thus  obtain  the  formula 


/(a) 

m 

/(c) 

/(«) 

m 

/"( 7) 

5(o ) 

9(b) 

9(c) 

=  Uc~a)(c 

- b ) 

9(a) 

9(b) 

9"  (7) 

h(a) 

Kb) 

h(c) 

h(a) 

Kb) 

h"(  7) 

The  reader  will  now  complete  the  proof  without  difficulty.] 

9.  If  F(x)  is  a  function  which  has  continuous  derivatives  of  the  first  n  orders, 
of  which  the  first  n— 1  vanish  when  x  =  0,  and  A  5^  F^n\x)  A  B  when  0  A  x  ^  h, 
then  A(xn/n\)  fs  F(x)  A  B(xn/n\ )  when  0  A  x  ^  h. 

Apply  this  result  to 

Tn—  1 

/M  *  /(o)  -  xf(  0) - pjj/l”_1)(0), 

and  deduce  Taylor’s  Theorem. 

10.  If  A h<l>(x)  =  4>(x)  —  cf>(x  +  h),  A^(f>(x)  =  A/j{A h^>{x]},  and  so  on,  and 
4>(x)  has  derivatives  of  the  first  n  orders,  then 

A^(x)  =  )r(n)<t>(x  +  rh)  =  (-/r)>M(0, 

r= 0  'v?  ' 

where  £  lies  between  x  and  x  +  nh.  Deduce  that  if  Kn\x)  is  continuous  then 
{A ^(j)(x)}/hn  (— l)nKn\x)  as  h  — >  0.  [This  result  has  been  stated  already 

when  n  =  2,  in  Ex.  LV.  13.] 

11.  Deduce  from  Ex.  10  that  xn~m  A^xm  m(m  —  1) . . .  (m  —  n  +  l)hn  as 
x  — y  00,  m  being  any  rational  number  and  n  any  positive  integer.  In  particular 
prove  that 

x\[x{yfx  —  2y/x  +  1  +  Vx  +  2}  ->  — 

12.  Suppose  that  y  =  (j)(x)  is  a  function  of  x  with  continuous  derivatives  of 
at  least  the  first  four  orders,  and  that  7(0)  =  0,  (f>'(0)  =  1,  so  that 

y  =  4>{x)  =  x  +  a2x2  +  a3x3  +  (a4  +  ex)x 4, 
where  ex  — >  0  as  x  — >  0.  Establish  the  formula 

x  =  ip(y)  =  y  -  a2y 2  +  (2 a\  -  a3)y3  -  (5a3  -  5a2a3  +  a4  +  ey)y‘i , 
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where  ey  — >  0  as  y  — >  0,  for  that  value  of  x  which  vanishes  with  y,  and  prove 
that 

4>(x)i/j(x)  -x2  2 

I  >  a2 

X 4 

as  x  — >  0. 

13.  The  coordinates  (£,77)  of  the  centre  of  curvature  of  the  curve  x  =  /(£), 
y  =  F(t),  at  the  point  (x,y),  are  given  by 

-(£  -  =  (7  -  y)/x'  =  ( x '2  +  y,2)/(x'y"  -  x"y'); 

and  the  radius  of  curvature  of  the  curve  is 

(*'2  +  y,2)3/2/(x’y"  -  x'V). 

dashes  denoting  differentiations  with  respect  to  t. 

14.  The  coordinates  (£,  77)  of  the  centre  of  curvature  of  the  curve  27 ay2  =  4x3, 
at  the  point  (x,y),  are  given  by 

3  a(£  +  x)  +  2x2  =  0,  77  =  4y  +  (9  ay)/x. 


(Math.  Trip.  1899.) 

15.  Prove  that  the  circle  of  curvature  at  a  point  (x,  y)  will  have  contact  of 
the  third  order  with  the  curve  if  (1  +  y2)yj,  =  37/17/2  that  point.  Prove  also 
that  the  circle  is  the  only  curve  which  possesses  this  property  at  every  point;  and 
that  the  only  points  on  a  conic  which  possess  the  property  are  the  extremities 
of  the  axes.  [Cf.  Ch.  VI,  Misc.  Ex.  10  (iv).] 

16.  The  conic  of  closest  contact  with  the  curve  y  =  ax2+bx3+cx4+ ■  ■  ■ +kxn , 
at  the  origin,  is  a3y  =  aAx2  +a2bxy+(ac—h2)y2 .  Deduce  that  the  conic  of  closest 
contact  at  the  point  (£,77)  of  the  curve  y  =  f(x)  is 

I8772T  =  9??2(z  -  £)2  +  67/2 773(3:  -  £)T  +  (3772774  -  4t/|)T2, 

where  T  =  (77  —  77)  —  77 \(x  —  £).  (Math.  Trip.  1907.) 

17.  Homogeneous  functions.*  If  u  =  xn  f(y /x,  z/x, . . . )  then  u  is  unal¬ 
tered,  save  for  a  factor  \n,  when  x,  y,  z,  . . .  are  all  increased  in  the  ratio  A  :  1. 

*In  this  and  the  following  examples  the  reader  is  to  assume  the  continuity  of  all  the 
derivatives  which  occur. 
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In  these  circumstances  u  is  called  a  homogeneous  function  of  degree  n  in  the 
variables  x,  y,  z,  . . . .  Prove  that  if  u  is  homogeneous  and  of  degree  n  then 


du  du 


du 


- =nu. 

ox  ay  dz 


This  result  is  known  as  Euler’s  Theorem  on  homogeneous  functions. 

18.  If  u  is  homogeneous  and  of  degree  n  then  du/dx,  du/dy,  ...  are  homo¬ 
geneous  and  of  degree  n  —  1 . 

19.  Let  f(x,y)  =  0  be  an  equation  in  x  and  y  ( e.g .  xn  +  yn  —  x  =  0), 
and  let  F(x,y,z )  =  0  be  the  form  it  assumes  when  made  homogeneous  by  the 
introduction  of  a  third  variable  z  in  place  of  unity  (e.g.  xn  +  yn  —  xz n_1  =  0). 
Show  that  the  equation  of  the  tangent  at  the  point  (£,  rj)  of  the  curve  f(x,  y)  =  0 

is 

xF^  +  yFv  +  zF^  =  0, 

where  F^.  Fri,  F^  denote  the  values  of  Fx,  Fy,  Fz  when  x  =  £,  y  =  y,  z  =  £  =  1. 

20.  Dependent  and  independent  functions.  Jacobians  or  functional 
determinants.  Suppose  that  u  and  v  are  functions  of  x  and  y  connected  by  an 
identical  relation 

(j)(u,v)  =  0.  (1) 

Differentiating  (1)  with  respect  to  x  and  y,  we  obtain 


d <j)  du  dcj)  dv  dcj)  du  df  dv 

du  dx  dv  dx  ’  du  dy  dv  dy 


(2) 


and,  eliminating  the  derivatives  of  </>, 


V'X 

Vx 


Uh 


-  UxVy  UyVx  —  0, 


(3) 


where  ux,  uy,  vx,  vy  are  the  derivatives  of  u  and  v  with  respect  to  x  and  y. 
This  condition  is  therefore  necessary  for  the  existence  of  a  relation  such  as  (1). 
It  can  be  proved  that  the  condition  is  also  sufficient ;  for  this  we  must  refer  to 
Goursat’s  Cours  d’  Analyse,  vol.  i,  pp.  125  et  seq. 

Two  functions  u  and  v  are  said  to  be  dependent  or  independent  according 
as  they  are  or  are  not  connected  by  such  a  relation  as  (1).  It  is  usual  to  call  J 
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the  Jacobian  or  functional  determinant  of  u  and  v  with  respect  to  x  and  y,  and 
to  write 

=  d (u,v) 
d(x,  y) ' 

Similar  results  hold  for  functions  of  any  number  of  variables.  Thus  three 
functions  u,  v,  w  of  three  variables  x,  y ,  z  are  or  are  not  connected  by  a  relation 
4>(u,v,  w)  =  0  according  as 


J  = 


^ X 

Vx 

Wx 


Uy  UZ 
Vy  VZ 
Wy  WZ 


d(u,  v,  w) 
d(x,y,z ) 


does  or  does  not  vanish  for  all  values  of  x,  y,  z. 

21.  Show  that  ax2  +  2 hxy  +  by 2  and  Ax2  +  2 Hxy  +  By2  are  independent 
unless  a/ A  =  h/H  =  b/B. 

22.  Show  that  ax 2  +  by2  +  cz 2  +  2 fyz  +  2gzx  +  2 hxy  can  be  expressed  as  a 
product  of  two  linear  functions  of  x,  y,  and  z  if  and  only  if 

abc  +  2  fgh  —  af2  —  bg 2  —  ch2  =  0. 


[Write  down  the  condition  that  px  +  qy  +  rz  and  p'x  +  q’y  +  r'z  should  be 
connected  with  the  given  function  by  a  functional  relation.] 

23.  If  u  and  v  are  functions  of  £  and  77,  which  are  themselves  functions  of 
x  and  y,  then 

d(u,v)  _  d(u,v)  d(£,rj) 
d(x,y)  d(£,r))  d{x,y)' 

Extend  the  result  to  any  number  of  variables. 

24.  Let  f(x)  be  a  function  of  x  whose  derivative  is  1/x  and  which  vanishes 
when  x  =  1.  Show  that  if  u  =  f(x)  +  f(y),  v  =  xy,  then  uxvy  —  uyvx  =  0,  and 
hence  that  u  and  v  are  connected  by  a  functional  relation.  By  putting  y  =  1, 
show  that  this  relation  must  be  f{x)  +  f(y)  =  f(xy).  Prove  in  a  similar  manner 
that  if  the  derivative  of  f(x)  is  1/(1  +  x2),  and  /( 0)  =  0,  then  f(x)  must  satisfy 
the  equation 

x  +  y  \ 


f(x)  +  f{y)  =  f 


1  —  xy 
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25.  Prove  that  if  f(x)  = 

Jo 


dt 


then 


f{x)+  f{y)  =  f 


o  Vl  -  t 4 

x^/l  -  yA  +  y\J\  -  x4 


1  +  x2y2 


26.  Show  that  if  a  functional  relation  exists  between 

u  =  f(x)+f(y)+f(z),  v  =  f(y)f(z)+f(z)f(x)+f(x)f(y),  w  =  f (x) f (y) / (z) , 

then  /  must  be  a  constant.  [The  condition  for  a  functional  relation  will  be  found 
to  be 


f'(x)f'(y)f'(z){f(y)  -  f(z)}{f(z)  -  f(x)}{f(x)  -  f(y)}  =  0.] 

27.  If  f(y,  z),  f(z,  x),  and  /(x,  y)  are  connected  by  a  functional  relation  then 

f(x,x)  is  independent  of  x.  (Math.  Trip.  1909.) 

28.  If  u  =  0,  v  =  0,  w  =  0  are  the  equations  of  three  circles,  rendered 
homogeneous  as  in  Ex.  19,  then  the  equation 

d(u,v,w)  = 
d(x,y,z ) 

represents  the  circle  which  cuts  them  all  orthogonally.  (Math.  Trip.  1900.) 

29.  If  A,  B,  C  are  three  functions  of  x  such  that 

A  A'  A" 

B  B'  B" 

C  C'  C” 

vanishes  identically,  then  we  can  find  constants  A,  y,  v  such  that  A  A  +  yB  +  vC 
vanishes  identically;  and  conversely.  [The  converse  is  almost  obvious.  To  prove 
the  direct  theorem  let  a  =  BC  —  B'C ,  ....  Then  a'  =  BC"  —  B" C ,  . . . ,  and 
it  follows  from  the  vanishing  of  the  determinant  that  (3^'  —  /Tq  =  0,  . . . ;  and  so 
that  the  ratios  a  :  (3  :  7  are  constant.  But  aA  +  f3B  +  7 C  =  0.] 

30.  Suppose  that  three  variables  x,  y ,  z  are  connected  by  a  relation  in  virtue 
of  which  (i)  z  is  a  function  of  x  and  y,  with  derivatives  zx,  zy,  and  (ii)  x  is  a 
function  of  y  and  z,  with  derivatives  xy,  xz.  Prove  that 
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[We  have 


dz  =  zxdx  +  zy  dy ,  dx  =  xy  dy  +  dz. 
The  result  of  substituting  for  dx  in  the  first  equation  is 


dz  =  ( zxXy  +  zy)  dy  +  zxxz  dz , 

which  can  be  true  only  if  zxxy  +  zy  =  0,  zxxz  =  1.] 

31.  Four  variables  x,  y,  z,  u  are  connected  by  two  relations  in  virtue  of  which 
any  two  can  be  expressed  as  functions  of  the  others.  Show  that 


Vzzxxy  =  ~yXzzVxxl  =  Xzzx  +  Vzzv  =  1 


z"x  1  tfz  y 


where  y"  denotes  the  derivative  of  y,  when  expressed  as  a  function  of  z  and  u, 
with  respect  to  z.  (Math.  Trip.  1897.) 

32.  Find  A,  B,  C,  X  so  that  the  first  four  derivatives  of 


pa- \-x 

/  f(t)  dt  —  x[Af(a)  +  Bf(a  +  Ax)  +  C f(a  +  x)] 

J  a 

vanish  when  x  =  0;  and  A,  B,  C,  D ,  A,  p  so  that  the  first  six  derivatives  of 

pa+x 

/  f(t)  dt  —  x[Af(a)  +  Bf(a  +  Ax)  +  Cf(a  +  px)  +  Df(a  +  x)] 

J  a 

vanish  when  x  =  0. 

33.  If  a  >  0,  ac  —  b2  >  0,  and  xi  >  xq,  then 


'x0 


Xl  dx  _ 

ax2  +  2hx  +  c  yjac  -  b 2 


1  J  (xi  -  x0)Vac  -  b 2 

arc  tan  ■ 


axjxo  +  b(x  i  +  xq)  +  c 


the  inverse  tangent  lying  between  0  and  n.* 

/i  sm  ex  dx 

- For  what  values  of  a  is  the 

_i  1  —  2x  cos  a  +  xz 

integral  a  discontinuous  function  of  a? 


(Math.  Trip.  1904.) 


*In  connection  with  Exs.  33-35,  38,  and  40  see  a  paper  by  Dr  Bromwich  in  vol.  xxxv 
of  the  Messenger  of  Mathematics. 
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[The  value  of  the  integral  is  \i r  if  2?r7r  <  a  <  (2ra  +  l)7r,  and  —  if 

(2 n  —  l)7r  <  a  <  2n7r,  n  being  any  integer;  and  0  if  a  is  a  multiple  of  7 r.] 

35.  If  ax 2  +  2 bx  +  c  >  0  when  xq  ^  x  ^  x\,  f(x)  =  V ax2  +  2 bx  +  c,  and 


V  =  f{x ),  y0  =  f(x0),  yi  =  f(xi),  X  =  (xi  -  x0)/(y1  +  y0), 


then 


rXl  dx  _  1  1  +  X^  -2 

Lo  y  \/a  °S  1  -  \/— a 


arctan{V  \/— a}, 


according  as  a  is  positive  or  negative.  In  the  latter  case  the  inverse  tangent  lies 


between  0  and  ^7r.  [It  will  be  found  that  the  substitution  t  = 


rX 


x  —  XQ 

y  +  yo 


reduces 


the  integral  to  the  form  2 
36.  Prove  that 


dt 


1  —  at2' 


fa  dx 
I o  x  +  Va2  —  x2 


=  4^ 


(Math.  Trip.  1913.) 


37.  If  a  >  1  then 


r1  Vi  x 2  dx  =  _  ^  _  1} 

J-i  a-  x 


38.  If  p  >  1,  0  <  q  <  1,  then 

I"1  dx  2u 

Jo  vTl  +  (p2  -  1)*}{1  -  (1  -  <?2).x}  (p  +  <?)sinu;’ 

where  cj  is  the  positive  acute  angle  whose  cosine  is  (1  +  pq)/(p  +  q )• 

39.  If  a  >  b  >  0,  then 


sin2  9  d9 
a  —  b  cos  9 


(Math.  Trip.  1904.) 
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40.  Prove  that  if  a  >  y/b2  +  c2  then 


d6  2  J  V a2  —  b2  —  c2 

- ; - ^ ^  =  ,  arc  tan  <  - 

a  +  b  cos  6  +  c  sm  6  yja2  —  b2  —  c2  )  c 

the  inverse  tangent  lying  between  0  and  tt. 

[b 

41.  If  f(x)  is  continuous  and  never  negative,  and  /  f(x)dx  =  0,  then 

J  a 

f(x)  =  0  for  all  values  of  x  between  a  and  b.  [If  f(x)  were  equal  to  a  positive 
number  k  when  x  =  £,  say,  then  we  could,  in  virtue  of  the  continuity  of  f(x), 
find  an  interval  [£  —  <5,  £  +  5]  throughout  which  f(x)  >  \ k ;  and  then  the  value  of 
the  integral  would  be  greater  than  5k.] 

42.  Schwarz’s  inequality  for  integrals.  Prove  that 

(pb  \2  pb  pb 

/  (fn/jdx  ]  S  (j)2dx  il)2  dx. 

J  a  /  J  a  J  a 


[Use  the  definitions  of  §§  156  and  157,  and  the  inequality 


<V)2  =  E02 


(Ch.  I,  Misc.  Ex.  10).] 

43.  If 

then  Pn(x)  is  a  polynomial  of  degree  n,  which  possesses  the  property  that 


Pn{x)9{x)  dx  =  0 


if  d(x)  is  any  polynomial  of  degree  less  than  n.  [Integrate  by  parts  m  +  1  times, 
where  m  is  the  degree  of  9(x),  and  observe  that  9^m+1\x)  =  0.] 

44.  Prove  that 


rP 


Pm (x)Pn  (x)  dx  =  0 


if  m  /  n,  but  that  if  m  =  n  then  the  value  of  the  integral  is  (/3  —  a)/(2n  +  1). 
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45.  If  Qn(x )  is  a  polynomial  of  degree  n,  which  possesses  the  property  that 


dx  =  0 


if  0(x)  is  any  polynomial  of  degree  less  than  n,  then  Qn(x)  is  a  constant  multiple 
of  Pn(x). 

[We  can  choose  k  so  that  Qn  —  kPu  is  of  degree  n  —  1:  then 
rP  rP 

I  Qn  (Qn  P} l )  dx  —  0,  I  Pn(Qn  I^Pn)  dx  =  0, 

J  a  J  a 


and  so 


kPu )2  dx  =  0. 


Now  apply  Ex.  41.] 

46.  Approximate  Values  of  definite  integrals.  Show  that  the  error  in 

fb 

taking  \(b  —  a){4>(a)  +  4>(b)}  as  the  value  of  the  integral  /  <j)(x)  dx  is  less  than 

J  a 

j^M(b  —  a)3,  where  M  is  the  maximum  of  \(j)"{x)\  in  the  interval  [a,  b];  and 
that  the  error  in  taking  (b  —  a)cf){\{a  +  b )}  is  less  than  V M(b  —  a)3.  [Write 
f{x)  =  4>(x)  in  Exs.  4  and  5.]  Show  that  the  error  in  taking 


gO  -  a)\(j){a)  +  4>(b)  +  4 4>{\{a  +  b)}} 

as  the  value  is  less  than  ^ ]^jM(b  —  a)5,  where  M  is  the  maximum  of  x ). 

[Use  Ex.  6.  This  rule,  which  gives  a  very  good  approximation,  is  known  as 
Simpson’s  Rule.  It  amounts  to  taking  one-third  of  the  first  approximation 
given  above  and  two-thirds  of  the  second.] 

Show  that  the  approximation  assigned  by  Simpson’s  Rule  is  the  area  bounded 
by  the  lines  x  =  a,  x  =  b,  y  =  0,  and  a  parabola  with  its  axis  parallel  to  OY 
and  passing  through  the  three  points  on  the  curve  y  =  (j>(x)  whose  abscissae  are 
a,  4(a  +  b),  b. 

It  should  be  observed  that  if  <f>(x)  is  any  cubic  polynomial  then  <f>P\x)  =  0, 
and  Simpson’s  Rule  is  exact.  That  is  to  say,  given  three  points  whose  abscissae 
are  a,  ^(a  +  b),  b ,  we  can  draw  through  them  an  infinity  of  curves  of  the  type 
y  =  a  +  f3x  +  72 2  +  Ax3;  and  all  such  curves  give  the  same  area.  For  one  curve 
S  =  0,  and  this  curve  is  a  parabola. 
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47.  If  4>(x)  is  a  polynomial  of  the  fifth  degree,  then 

[  <j>(x)dx  =  ±{5cj)(a)  +  8(f)(1)  +  50(/3)}, 

Jo 

a  and  /3  being  the  roots  of  the  equation  x2  —  x  +  ^  =  0.  (Math.  Trip.  1909.) 

48.  Apply  Simpson’s  Rule  to  the  calculation  of  it  from  the  formula 
/*!  dx 

\i r  =  /  - - o'  [The  result  is  .7833....  If  we  divide  the  integral  into  two, 

Jo  1  +  x 

from  0  to  |  and  )  to  1,  and  apply  Simpson’s  Rule  to  the  two  integrals 
separately,  we  obtain  .785  391  6  . . . .  The  correct  value  is  .785  398  1 . . . .] 

49.  Show  that 


8.9  < 


4  +  x2  dx  <  9. 


50.  Calculate  the  integrals 
f1  dx  f1  c 

Jo  1  +  x  ’  Jo  VJJ 


^sin  x  dx , 


(Math.  Trip.  1903.) 


to  x 


to  two  places  of  decimals.  [In  the  last  integral  the  subject  of  integration  is  not 
defined  when  x  =  0:  but  if  we  assign  to  it,  when  x  =  0,  the  value  1,  it  becomes 
continuous  throughout  the  range  of  integration.] 


CHAPTER  VIII 


THE  CONVERGENCE  OF  INFINITE  SERIES  AND 
INFINITE  INTEGRALS 

165.  In  Ch.  IV  we  explained  what  was  meant  by  saying  that  an  infi¬ 
nite  series  is  convergent ,  divergent ,  or  oscillatory ,  and  illustrated  our  def¬ 
initions  by  a  few  simple  examples,  mainly  derived  from  the  geometrical 
series 

1  +  x  +  x2  +  . . . 

and  other  series  closely  connected  with  it.  In  this  chapter  we  shall  pursue 
the  subject  in  a  more  systematic  manner,  and  prove  a  number  of  theorems 
which  enable  us  to  determine  when  the  simplest  series  which  commonly 
occur  in  analysis  are  convergent. 

We  shall  often  use  the  notation 

n 

n m  T  ,ur7l_)_i  T  ■  ■  ■  T  un  ^  '  0(^)5 


00 

and  write  Y  um  or  simply  Y  un,  for  the  infinite  series  Uq  +  u±  +  U2  +  . . .  .* 
0 

166.  Series  of  Positive  Terms.  The  theory  of  the  convergence  of 
series  is  comparatively  simple  when  all  the  terms  of  the  series  considered 
are  positive. t  We  shall  consider  such  series  first,  not  only  because  they  are 
the  easiest  to  deal  with,  but  also  because  the  discussion  of  the  convergence 

*It  is  of  course  a  matter  of  indifference  whether  we  denote  our  series  by  u\  +  112  +  ■  ■  ■ 
(as  in  Ch.  IV)  or  by  uq  +  u±  +  . . .  (as  here).  Later  in  this  chapter  we  shall  be  concerned 
with  series  of  the  type  a 0  +  a\X  +  a 2X2  +  ... :  for  these  the  latter  notation  is  clearly 
more  convenient.  We  shall  therefore  adopt  this  as  our  standard  notation.  But  we  shall 
not  adhere  to  it  systematically,  and  we  shall  suppose  that  u±  is  the  first  term  whenever 
this  course  is  more  convenient.  It  is  more  convenient,  for  example,  when  dealing  with 
the  series  1  +  3  +  5  +  •  •  • ,  to  suppose  that  un  =  1/n  and  that  the  series  begins  with  u\ , 
than  to  suppose  that  un  =  l/(n  +  1)  and  that  the  series  begins  with  uq.  This  remark 
applies,  e.g.,  to  Ex.  LXVIII.  4. 

I  Here  and  in  what  follows  ‘positive’  is  to  be  regarded  as  including  zero. 
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of  a  series  containing  negative  or  complex  terms  can  often  be  made  to 
depend  upon  a  similar  discussion  of  a  series  of  positive  terms  only. 

When  we  are  discussing  the  convergence  or  divergence  of  a  series  we 
may  disregard  any  finite  number  of  terms.  Thus,  when  a  series  contains  a 
finite  number  only  of  negative  or  complex  terms,  we  may  omit  them  and 
apply  the  theorems  which  follow  to  the  remainder. 

167.  It  will  be  well  to  recall  the  following  fundamental  theorems  es¬ 
tablished  in  §  77. 

A.  A  series  of  positive  terms  must  be  convergent  or  diverge  to  oo,  and 
cannot  oscillate. 

B.  The  necessary  and  sufficient  condition  that  Yun  should  be  conver¬ 
gent  is  that  there  should  be  a  number  K  such  that 


u0  +  Mi  H - \-un  <  K 


for  all  values  of  n. 

C.  The  comparison  theorem.  If  Y  un  is  convergent,  and  vn  ^  un 
for  all  values  of  n,  then  Yvn  is  convergent,  and  Y  vn  =  Yun-  More 
generally,  if  vn  A  Kun,  where  K  is  a  constant,  then  Yvn  is  convergent 
and  Yvn  =  KYun ■  Mid  if  Yun  is  divergent,  and  vn  A  Kun,  then 
Yvn  is  divergent.* 

Moreover,  in  inferring  the  convergence  or  divergence  of  Y  vn  by  means 
of  one  of  these  tests,  it  is  sufficient  to  know  that  the  test  is  satisfied  for 
sufficiently  large  values  of  n,  i.e.  for  all  values  of  n  greater  than  a  definite 
value  n0.  But  of  course  the  conclusion  that  Yvn  =  KYun  does  not 
necessarily  hold  in  this  case. 

A  particularly  useful  case  of  this  theorem  is 

D.  IfYun  is  convergent  ( divergent )  and  un/vn  tends  to  a  limit  other 
than  zero  as  n  — >■  oo,  then  Yvn  is  convergent  (divergent) . 


*The  last  part  of  this  theorem  was  not  actually  stated  in  §  77,  but  the  reader  will 
have  no  difficulty  in  supplying  the  proof. 
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168.  First  applications  of  these  tests.  The  one  important  fact 
which  we  know  at  present,  as  regards  the  convergence  of  any  special  class 
of  series,  is  that  'Yfjrn  is  convergent  if  r  <  1  and  divergent  if  r  ^  1.*  It  is 
therefore  natural  to  try  to  apply  Theorem  C,  taking  un  =  rn.  We  at  once 
find 

1.  The  series  is  convergent  if  vn  V  Krn,  where  r  <  1,  for  all 

sufficiently  large  values  of  n. 

When  K  =  1,  this  condition  may  be  written  in  the  form  v\[n  V  r. 
Hence  we  obtain  what  is  known  as  Cauchy’s  test  for  the  convergence  of 
a  series  of  positive  terms;  viz. 

2.  The  series  ^2vn  is  convergent  if  eg! n  ^  r,  where  r  <  1,  for  all 
sufficiently  large  values  of  n. 

There  is  a  corresponding  test  for  divergence,  viz. 

2  a.  The  series  vn  is  divergent  if  eg! n  ^  1  for  an  infinity  of  values 
of  n. 

This  hardly  requires  proof,  for  eg/n  ^  1  involves  vn  ^  1.  The  two 
theorems  2  and  2  a  are  of  very  wide  application,  but  for  some  purposes  it 
is  more  convenient  to  use  a  different  test  of  convergence,  viz. 

3.  The  series  is  convergent  if  vn+ i/vn  ^  r,  r  <  1,  for  all  suffi¬ 

ciently  large  values  of  n. 

To  prove  this  we  observe  that  if  vn+\/vn  ^  r  when  n  ^  n0  then 


vn 


v 


n 


Vn-l 


Vn— 1  Vn—2 


Vnp+1 

vno 


Vn0 


< 


Vn0.rn. 
rno  ’ 


and  the  result  follows  by  comparison  with  the  convergent  series  ^  rn.  This 
test  is  known  as  d’Alembert’s  test.  We  shall  see  later  that  it  is  less  gen¬ 
eral,  theoretically,  than  Cauchy’s,  in  that  Cauchy’s  test  can  be  applied 
whenever  d’Alembert’s  can,  and  sometimes  when  the  latter  cannot.  More¬ 
over  the  test  for  divergence  which  corresponds  to  d’Alembert’s  test  for 
convergence  is  much  less  general  than  the  test  given  by  Theorem  2a.  It  is 
true,  as  the  reader  will  easily  prove  for  himself,  that  if  vn+i/vn  ^  r  ^  1 
for  all  values  of  n,  or  all  sufficiently  large  values,  then  ]C  vn  is  divergent. 

*We  shall  use  r  in  this  chapter  to  denote  a  number  which  is  always  positive  or  zero. 
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But  it  is  not  true  (see  Ex.  LXVII.  9)  that  this  is  so  if  only  vn+i/vn  A  r  ^  1 
for  an  infinity  of  values  of  n,  whereas  in  Theorem  2  a  onr  test  had  only 
to  be  satisfied  for  such  an  infinity  of  values.  None  the  less  d’Alembert’s 
test  is  very  useful  in  practice,  because  when  vn  is  a  complicated  function 
vn+i/vn  is  often  much  less  complicated  and  so  easier  to  work  with. 

In  the  simplest  cases  which  occur  in  analysis  it  often  happens  that 
vn+i/vn  or  v]jn  tends  to  a  limit  as  n  — >  oo.*  When  this  limit  is  less  than  1, 
it  is  evident  that  the  conditions  of  Theorems  2  or  3  above  are  satisfied. 
Thus 

4.  If  vl/n  or  vn+i/vn  tends  to  a  limit  less  than  unity  as  n  — >■  oo,  then 
the  series  ^  vn  is  convergent. 

It  is  almost  obvious  that  if  either  function  tend  to  a  limit  greater  than 
unity,  then  is  divergent.  We  leave  the  formal  proof  of  this  as  an 

exercise  to  the  reader.  But  when  vf!n  or  vn+i/vn  tends  to  1  these  tests 
generally  fail  completely,  and  they  fail  also  when  v\[n  or  vn+i/vn  oscillates 
in  such  a  way  that,  while  always  less  than  1,  it  assumes  for  an  infinity  of 
values  of  n  values  approaching  indefinitely  near  to  1.  And  the  tests  which 
involve  vn+i/vn  fail  even  when  that  ratio  oscillates  so  as  to  be  sometimes 
less  than  and  sometimes  greater  than  1.  When  vl/n  behaves  in  this  way 
Theorem  2  a  is  sufficient  to  prove  the  divergence  of  the  series.  But  it  is 
clear  that  there  is  a  wide  margin  of  cases  in  which  some  more  subtle  tests 
will  be  needed. 

Examples  LXVII.  1.  Apply  Cauchy’s  and  d’Alembert’s  tests  (as  spe¬ 
cialised  in  4  above)  to  the  series  J2nkrn,  where  k  is  a  positive  rational  number. 

[Here  vn+i/vn  =  {(n  +  l)/n}fcr  — >  r ,  so  that  d’Alembert’s  test  shows  at 
once  that  the  series  is  convergent  if  r  <  1  and  divergent  if  r  >  1.  The  test 
fails  if  r  =  1:  but  the  series  is  then  obviously  divergent.  Since  limn1/”  =  1 
(Ex.  XXVII.  11),  Cauchy’s  test  leads  at  once  to  the  same  conclusions.] 

2.  Consider  the  series  ]T)(Anfc  +  Bnk~l  +  •  •  •  +  K)rn .  [We  may  suppose  A 
positive.  If  the  coefficient  of  rn  is  denoted  by  P(n),  then  P(n)/nk  — >  A  and,  by 

*It  will  be  proved  in  Ch.  IX  (Ex.  lxxxvii.  36)  that  if  vn+i/vn  — >  l  then  v\f  "  — >  l. 
That  the  converse  is  not  true  may  be  seen  by  supposing  that  vn  =  1  when  n  is  odd  and 
vn  =  2  when  n  is  even. 
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D  of  §  167,  the  series  behaves  like  Y2  nkrn.\ 

3.  Consider 


E 


Ank  +  Bnk  1  +  •  •  •  +  K 
- r 

an1  +  fin1-1  +  ■■■  +  « 


(A  >  0,  a  >  0). 


[The  series  behaves  like  lrn.  The  case  in  which  r  =  1,  k  <  l  requires 

further  consideration.] 

4.  We  have  seen  (Ch.  IV,  Misc.  Ex.  17)  that  the  series 

_ I _ ,  y _ I _ 

n(n  +  1)  ’  '  n(n  +  1) . . .  (n  +  p) 

are  convergent.  Show  that  Cauchy’s  and  d’Alembert’s  tests  both  fail  when 
applied  to  them.  [For  liiri  ul/n  =  lim(un+i  /un)  =  1.] 

5.  Show  that  the  series  ^  n_p,  where  p  is  an  integer  not  less  than  2,  is 
convergent.  [Since  lim{n(n  +  1) ...  (n  +  p  —  l)}/np  =  1,  this  follows  from  the 
convergence  of  the  series  considered  in  Ex.  4.  It  has  already  been  shown  in 
§  77,  (7)  that  the  series  is  divergent  if  p  =  1,  and  it  is  obviously  divergent  if 
p  E  0.] 

6.  Show  that  the  series 


E 


Ank  +  Bnk  1  +  •  •  •  +  K 
an1  +  fin}-1  +  •••  +  /« 


is  convergent  if  l  >  k  +  1  and  divergent  if  l  %  k  +  1. 

7.  If  mn  is  a  positive  integer,  and  mn+i  >  mn,  then  the  series  ^  rmn  is 
convergent  if  r  <  1  and  divergent  if  r  ^  1.  For  example  the  series  1  +  r  +  r4  + 
r9  +  . . .  is  convergent  if  r  <  1  and  divergent  if  r  ^  1. 

8.  Sum  the  series  1  +  2 r  +  2 r4  +  . . .  to  24  places  of  decimals  when  r  =  .1 
and  to  2  places  when  r  =  .9.  [If  r  =  .1,  then  the  first  5  terms  give  the  sum 
1.200  200  002  000  000  2,  and  the  error  is 


2r25  +  2r36  +  •  •  ■  <  2r25  +  2 r36  +  2 r47  +  •  •  ■  =  2r25/(l  -  r11)  <  3/1025. 


If  r  =  .9,  then  the  first  8  terms  give  the  sum  5.458  . . . ,  and  the  error  is  less  than 
2r64/ (1  -  r17)  <  .003.] 
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9.  If  0  <  a  <  b  <  1,  then  the  series  a  +  b  +  a2  +  b2  +  a3  +  . . .  is  convergent. 
Show  that  Cauchy’s  test  may  be  applied  to  this  series,  but  that  d’Alembert’s 
test  fails.  [For 


V2n+l/v2n  =  ( b/d ) 


n+ 1 


oo,  V2n+2/v2n+ 1  =  b(a/b) 


n+2 


0. 


2  S  2  S 

f  j"-*  f 

10.  The  series  1  +  r  H — H — -  +  . . .  and  1  +  r  H — ^  — =■  +  ...  are  convergent 

2!  3!  2Z  3d 

for  all  positive  values  of  r. 

11.  If  Y  un  is  convergent  then  so  are  Y  un  and  Y  un/ (1  +  un). 

12.  If  Y  un  is  convergent  then  so  is  Y  un/n.  [For  2 un/n  ^  u2  +  (1/n2)  and 
^(1/n2)  is  convergent.] 


13.  Show  that 


and 


1  1  _  3  /  1  1 

1+32  +  52+"'“4l1+22  +  32+'" 


1  1  1  1  1  1  _  i5  /  1  1 

1+22  +  32+P  +  62  +  72  +  92+"'_l6l1+22  +  32  + 


[To  prove  the  first  result  we  note  that 

1  +  P  +  ^2+'--=  (1  +  ^)  +  (^2+^)  +  •■■ 

1  i  1/11 

-1  +  P  +  P  +  -"  +  ^2(v1  +  ^2+^2+-' 

by  theorems  (8)  and  (6)  of  §  77.] 

14.  Prove  by  a  reductio  ad  absurdum  that  Y^-/n)  is  divergent.  [If  the  series 
were  convergent  we  should  have,  by  the  argument  used  in  Ex.  13, 


1  +  3  +  gH - —  (l+3  +  5  +  --  -)  +  3(1  +  5  +  3  +  ‘-  -); 


or 

1  +  1  +  1  +  .. .  =  1  +  1  +  1  + 

which  is  obviously  absurd,  since  every  term  of  the  first  series  is  less  than  the 
corresponding  term  of  the  second.] 
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169.  Before  proceeding  further  in  the  investigation  of  tests  of  con¬ 
vergence  and  divergence,  we  shall  prove  an  important  general  theorem 
concerning  series  of  positive  terms. 


Dirichlet’s  Theorem.*  The  sum  of  a  series  of  positive  terms  is  the 
same  in  whatever  order  the  terms  are  taken. 

This  theorem  asserts  that  if  we  have  a  convergent  series  of  positive 
terms,  uq  +  U\  +  u2  +  . . .  say,  and  form  any  other  series 


V0  +V1+V2  +  ... 


out  of  the  same  terms,  by  taking  them  in  any  new  order,  then  the  second 
series  is  convergent  and  has  the  same  sum  as  the  first.  Of  course  no  terms 
must  be  omitted:  every  u  must  come  somewhere  among  the  't/s,  and  vice 
versa. 

The  proof  is  extremely  simple.  Let  s  be  the  sum  of  the  series  of  u's. 
Then  the  sum  of  any  number  of  terms,  selected  from  the  u's ,  is  not  greater 
than  s.  But  every  v  is  a  u,  and  therefore  the  sum  of  any  number  of  terms 
selected  from  the  v's  is  not  greater  than  s.  Hence  vn  is  convergent,  and 
its  sum  t  is  not  greater  than  s.  But  we  can  show  in  exactly  the  same  way 
that  s  <  t.  Thus  s  =  t. 


170.  Multiplication  of  Series  of  Positive  Terms.  An  immediate 
corollary  from  Dirichlet’s  Theorem  is  the  following  theorem:  if  Uq  +  U\  + 
u2  +  •  •  •  and  v0  +  Vi  +  v2  +  . . .  are  two  convergent  series  of  positive  terms, 
and  s  and  t  are  their  respective  sums,  then  the  series 


U0V0  +  {uiV0  +  U0Vi)  +  (■ U2V0  +  U1V1  +  U0V2)  +  .  .  . 


is  convergent  and  has  the  sum  st. 


This  theorem  seems  to  have  first  been  stated  explicitly  by  Dirichlet  in  1837.  It  was 
no  doubt  known  to  earlier  writers,  and  in  particular  to  Cauchy. 
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Arrange  all  the  possible  products  of  pairs  umvn  in  the  form  of  a  doubly 
infinite  array 


UqVq 

U\Vq 

U2Vq 

UqVq 

UqV  i 

Mi  Mi 

U2V  i 

UqV  i 

UqV2 

MiM2 

U2V2 

UqV2 

UqVq 

MiM3 

U2V3 

UqVq 

We  can  rearrange  these  terms  in  the  form  of  a  simply  infinite  series  in  a 
variety  of  ways.  Among  these  are  the  following. 

(1)  We  begin  with  the  single  term  UqVq  for  which  m  +  n  =  0;  then  we 
take  the  two  terms  U\Vq,  UqV\  for  which  m  +  n  =  1;  then  the  three  terms 
u2v o,  UiVi,  uqV 2  for  which  m  +  n  =  2;  and  so  on.  We  thus  obtain  the  series 

U0v0  +  (u  1^0  +  UqV i)  +  (u2Vq  +  UiVi  +  U0V2)  +  .  .  . 
of  the  theorem. 

(2)  We  begin  with  the  single  term  u0yo  f°r  which  both  suffixes  are 
zero;  then  we  take  the  terms  UiV0,  U\Vi,  u0v i  which  involve  a  suffix  1  but 
no  higher  suffix;  then  the  terms  u2Vo,  u2Vi,  u2v2,  U\V2)  UqV2  which  involve  a 
suffix  2  but  no  higher  suffix;  and  so  on.  The  sums  of  these  groups  of  terms 
are  respectively  equal  to 

UqVq ,  (u0  +  Ml)  (u0  +  Ml)  -  U0V0, 

(U0  +  Mi  +  U2)(V0  +  Mi  +  V2)  ~  (' Uq  +  Mi)  {vq  +  Mi),  .  .  . 

and  the  sum  of  the  first  n  +  1  groups  is 

(mo  +  Ml  +  •  •  •  +  Mn)(u  0  +  Ml  +  •  •  •  +  Mn) , 

and  tends  to  st  as  n  — >  oo.  When  the  sum  of  the  series  is  formed  in  this 
manner  the  sum  of  the  first  one,  two,  three,  . . .  groups  comprises  all  the 
terms  in  the  first,  second,  third,  . . .  rectangles  indicated  in  the  diagram 
above. 

The  sum  of  the  series  formed  in  the  second  manner  is  st.  But  the  first 
series  is  (when  the  brackets  are  removed)  a  rearrangement  of  the  second; 
and  therefore,  by  Dirichlet’s  Theorem,  it  converges  to  the  sum  st.  Thus 
the  theorem  is  proved. 
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Examples  LXVIII.  1.  Verify  that  if  r  <  1  then 
1  +  r2  +  r  +  r4  +  r6  +  r3  +  •  •  ■  =  1  +  r  +  r3  +  r2  +  r5  +  r7  +  •  •  ■  =  1/(1  —  r). 

2*  If  either  of  the  series  uq  +  u\  +  . . . ,  v$  +  v\  +  . . .  is  divergent,  then  so  is 
the  series  uqvq  +  (niuo  +  uovi )  +  (u2Vq  +  u\v\  +  U0V2)  +  •  •  • ,  except  in  the  trivial 
case  in  which  every  term  of  one  series  is  zero. 

3.  If  the  series  uq  +  u\  +  . . . ,  vq  +  v\  +  . . . ,  wo  +  w±  +  . . .  converge  to 
sums  r,  s,  t,  then  the  series  ^ k ,  where  A k  =  Y1  umVn.Wp ,  the  summation  being 
extended  to  all  sets  of  values  of  m,  n,  p  such  that  m  +  n  +  p  =  k,  converges  to 
the  sum  rst. 

4.  If  J2  un  and  ^  vn  converge  to  sums  s  and  t,  then  the  series  ^  wn,  where 
wn  =  EuiVm,  the  summation  extending  to  all  pairs  l,  m  for  which  Im  =  n, 
converges  to  the  sum  st. 

171.  Further  tests  for  convergence  and  divergence.  The  exam¬ 
ples  on  pp.  385-387  suffice  to  show  that  there  are  simple  and  interesting 
types  of  series  of  positive  terms  which  cannot  be  dealt  with  by  the  general 
tests  of  §  168.  In  fact,  if  we  consider  the  simplest  type  of  series,  in  which 
un+i/un  tends  to  a  limit  as  n  — *  00,  the  tests  of  §  168  generally  fail  when 
this  limit  is  1.  Thus  in  Ex.  lxvii.  5  these  tests  failed,  and  we  had  to  fall 
back  upon  a  special  device,  which  was  in  essence  that  of  using  the  series 
of  Ex.  LXVII.  4  as  our  comparison  series,  instead  of  the  geometric  series. 

The  fact  is  that  the  geometric  series,  by  comparison  with  which  the  tests  of 
§  168  were  obtained,  is  not  only  convergent  but  very  rapidly  convergent,  far  more 
rapidly  than  is  necessary  in  order  to  ensure  convergence.  The  tests  derived  from 
comparison  with  it  are  therefore  naturally  very  crude,  and  much  more  delicate 
tests  are  often  wanted. 

We  proved  in  Ex.  xxvil.  7  that  nkrn  — >  0  as  n  — >  00,  provided  r  <  1, 
whatever  value  k  may  have;  and  in  Ex.  LXVII.  1  we  proved  more  than  this,  viz. 
that  the  series  ^nfcrn  is  convergent.  It  follows  that  the  sequence  r,  r2,  r3,  . . . , 
rn,  . . . ,  where  r  <  1,  diminishes  more  rapidly  than  the  sequence  l~fc, 

3~fc,  . . . ,  n~k,  ....  This  seems  at  first  paradoxical  if  r  is  not  much  less  than 
unity,  and  k  is  large.  Thus  of  the  two  sequences 

2  4  _8_  .  -|  _j_  1 

3’  9’  27’  V  4096’  531,441’  "• 
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whose  general  terms  are  (|)n  and  n~12,  the  second  seems  at  first  sight  to  decrease 
far  more  rapidly.  But  this  is  far  from  being  the  case;  if  only  we  go  far  enough 
into  the  sequences  we  shall  find  the  terms  of  the  first  sequence  very  much  the 
smaller.  For  example, 

(2/3)4  =  16/81  <  1/5,  (2 / 3) 12  <  ( 1/5)3  <  (1/10)2,  (2/3)1000  <  (1/10)166, 

while 

1000-12  =  10-36; 

so  that  the  1000th  term  of  the  first  sequence  is  less  than  the  10130th  part  of  the 
corresponding  term  of  the  second  sequence.  Thus  the  series  ^/)(2/3)n  is  far  more 
rapidly  convergent  than  the  series  Yn ~12,  and  even  this  series  is  very  much 
more  rapidly  convergent  than  Yn~2* 

172.  We  shall  proceed  to  establish  two  further  tests  for  the  con¬ 
vergence  or  divergence  of  series  of  positive  terms,  Maclaurin’s  (or 
Cauchy’s)  Integral  Test  and  Cauchy’s  Condensation  Test,  which, 
though  very  far  from  being  completely  general,  are  sufficiently  general  for 
our  needs  in  this  chapter. 

In  applying  either  of  these  tests  we  make  a  further  assumption  as  to 
the  nature  of  the  function  un ,  about  which  we  have  so  far  assumed  only 
that  it  is  positive.  We  assume  that  un  decreases  steadily  with  n:  i.e.  that 
un+ 1  V  un  for  all  values  of  n,  or  at  any  rate  all  sufficiently  large  values. 

This  condition  is  satisfied  in  all  the  most  important  cases.  From  one  point 
of  view  it  may  be  regarded  as  no  restriction  at  all,  so  long  as  we  are  dealing 
with  series  of  positive  terms:  for  in  virtue  of  Dirichlet’s  theorem  above  we  may 
rearrange  the  terms  without  affecting  the  question  of  convergence  or  divergence; 
and  there  is  nothing  to  prevent  us  rearranging  the  terms  in  descending  order  of 
magnitude ,  and  applying  our  tests  to  the  series  of  decreasing  terms  thus  obtained. 


*Five  terms  suffice  to  give  the  sum  of  Yn~12  correctly  to  7  places  of  decimals, 
whereas  some  10,000,000  are  needed  to  give  an  equally  good  approximation  to  Yfn~2- 
A  large  number  of  numerical  results  of  this  character  will  be  found  in  Appendix  III 
(compiled  by  Mr  J.  Jackson)  to  the  author’s  tract  ‘Orders  of  Infinity’  ( Cambridge  Math. 
Tracts,  No.  12). 
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But  before  we  proceed  to  the  statement  of  these  two  tests,  we  shall  state 
and  prove  a  simple  and  important  theorem,  which  we  shall  call  Abel’s 
Theorem.*  This  is  a  one-sided  theorem  in  that  it  gives  a  sufficient  test 
for  divergence  only  and  not  for  convergence,  but  it  is  essentially  of  a  more 
elementary  character  than  the  two  theorems  mentioned  above. 

173.  Abel’s  (or  Pringsheim’s)  Theorem.  If  un  is  a  convergent 
series  of  positive  and  decreasing  terms,  then  lim  nun  =  0. 

Suppose  that  nun  does  not  tend  to  zero.  Then  it  is  possible  to  find  a  positive 
number  6  such  that  nun  ^  5  for  an  infinity  of  values  of  n.  Let  n\  be  the  first 
such  value  of  n;  n2  the  next  such  value  of  n  which  is  more  than  twice  as  large 
as  n\\  n.3  the  next  such  value  of  n  which  is  more  than  twice  as  large  as  772;  and 
so  on.  Then  we  have  a  sequence  of  numbers  m,  ri2 ,  713,  ...  such  that  n2  >  2 m, 
n.3  >  2n2,  . . .  and  so  n2  —  n\  >  ^n2,  713  —  n2  >  1(713,  . . . ;  and  also  n\uni  ^  6, 
7i27in2  ^  5,  . . . .  But,  since  un  decreases  as  n  increases,  we  have 

UQ  +  UI-] - b  Ttni_l  ^  77l7tni  ^  5, 

Un  1  H - bTtna_i  ^  (n2  -  ni)un2  >  \n2Un2  ^  |(J, 

Un2-\ - 1-  Un3-1  ^  (773  -  n2)un3  >  \n^UH3  ^  \S, 

and  so  on.  Thus  we  can  bracket  the  terms  of  the  series  ^  un  so  as  to  obtain  a 
new  series  whose  terms  are  severally  greater  than  those  of  the  divergent  series 

S  +  7^5  +  ^(5  +  . . .  5 

and  therefore  Yl  un  is  divergent. 

Examples  LXIX.  1.  Use  Abel’s  theorem  to  show  that  ^(1/n)  and 
^{l/((m  +  b )}  are  divergent.  [Here  nun  — >  1  or  nun  — >  1/a.] 

2.  Show  that  Abel’s  theorem  is  not  true  if  we  omit  the  condition  that 
un  decreases  as  n  increases.  [The  series 

,  1  ,  1  ,  1  ,  1  ,  1  ,  1  ,  1  ,  1  ,  1 
1  +  ^2+^2+4  +  ^2+^2+72+^2  +  9  +  Y02+---’ 

*This  theorem  was  discovered  by  Abel  but  forgotten,  and  rediscovered  by  Pring- 
sheim. 
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in  which  un  =  1/n  or  1  /n2,  according  as  n  is  or  is  not  a  perfect  square,  is 
convergent,  since  it  may  be  rearranged  in  the  form 

1111111  /II 

22  +  32  +  52  +  62  +  72  +  P  +  T02+"'+V1  +  4  +  9  + 

and  each  of  these  series  is  convergent.  But,  since  nun  =  1  whenever  n  is  a  perfect 
square,  it  is  clearly  not  true  that  nun  — »•  0.] 

3.  The  converse  of  Abel’s  theorem  is  not  true,  i.e.  it  is  not  true  that,  if 
un  decreases  with  n  and  lim  nun  =  0,  then  Ylun  is  convergent. 

[Take  the  series  ^](l/n)  and  multiply  the  first  term  by  1,  the  second  by 
the  next  two  by  the  next  four  by  the  next  eight  by  g,  and  so  on.  On 
grouping  in  brackets  the  terms  of  the  new  series  thus  formed  we  obtain 


1  +  1.  1  +  1  (1  +  1)  +  1  (’1  +  1  +  1  +  n  +  • 

1  *  2  2^3V3^4/^4V5^6^7^8/^’‘-’ 


and  this  series  is  divergent,  since  its  terms  are  greater  than  those  of 


1  +  1. 1  +  1. 1  +  1.1  + 

'9  9  ~  3  9  i  4  9  ~  *  *  *  ? 


which  is  divergent.  But  it  is  easy  to  see  that  the  terms  of  the  series 


1  +  1. 1  +  1. 1  +  1. 1  +  1.1  + 

x  i  2  2  3  3  1  3  4  t  4  5’ 


satisfy  the  condition  that  nun  — >  0.  In  fact  nun  = 
v  -+  oo  as  n  — >  oo.] 


1  1 


+  ... 


l/v  if  2U~2 


<nS  ^ 


and 


174.  Maclaurin’s  (or  Cauchy’s)  Integral  Test.*  If  un  decreases 
steadily  as  n  increases,  we  can  write  un  =  <j)(n)  and  suppose  that  (f>(n)  is 
the  value  assumed,  when  x  =  n,  by  a  continuous  and  steadily  decreasing 
function  <f>{x)  of  the  continuous  variable  x.  Then,  If  v  is  any  positive 
integer,  we  have 

<j)(v  —  1)  ^  <f)(x)  ^  <f>{y) 

when  v  —  1  <  x  <  v.  Let 


x)  dx  =  /  {<f>(v —  1  )  —  cj)(x)}dx, 

Ju-l 

*The  test  was  discovered  by  Maclaurin  and  rediscovered  by  Cauchy,  to  whom  it  is 
usually  attributed. 


Vu  =  4>(v  -  1)  - 


<v- 1 
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so  that 


0  ^  vv  S  - 1)  -  0(vh 
Then  22  Vv  is  a  series  of  positive  terms,  and 

vi  +  v3  4 - h  vn  2  0(1)  -  Hn)  =  0(1)- 

Hence  22  vv  is  convergent,  and  so  v2  +  v3  +  ■  ■  ■  +  vn  or 


n—  1 


0(x) dx 

1  J 1 


1 


tends  to  a  positive  limit  as  n  — »  oo. 
Let  us  write 


$(0  =  I  <t>(x)  dx, 


so  that  <!>(£)  is  a  continuous  and  steadily  increasing  function  of  £.  Then 


Ml  +  u2  H - h  un-l  -  $(n) 


tends  to  a  positive  limit,  not  greater  than  0(1),  as  n  — *  oo.  Hence  is 
convergent  or  divergent  according  as  $(n)  tends  to  a  limit  or  to  inhn- 
ity  as  n  — »  oo,  and  therefore,  since  <E>(n)  increases  steadily,  according  as 
4>(£)  tends  to  a  limit  or  to  infinity  as  £  — >•  oo.  Hence  if  f>(x)  is  a  function 
of  x  which  is  positive  and  continuous  for  all  values  of  x  greater  than  unity, 
and  decreases  steadily  as  x  increases,  then  the  series 


0(1)  +  0(2)  +  •  •  • 

does  or  does  not  converge  according  as 


does  or  does  not  tend  to  a  limit  l  as  £  — >  oo;  and,  in  the  first  case,  the  sum 
of  the  series  is  not  greater  than  0(1)  +  l. 

The  sum  must  in  fact  be  less  than  0(1)  + 1.  For  it  follows  from  (6)  of  §  160, 
and  Ch.  VII,  Misc.  Ex.  41,  that  vv  <  —  1)  —  0(i/),  unless  0(x)  =  f{y) 

throughout  the  interval  [v  —  1,  v\\  and  this  cannot  be  true  for  all  values  of  v. 
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Examples  LXX.  1.  Prove  that 


E 

l 


1 

n2  +  1 


< 


+  i7r- 


2.  Prove  that 


OO 

i  ^  a 

**  <  V  a2  +  n2 


<  \k. 


3.  Prove  that  if  m  >  0  then 


(Math.  Trip.  1909.) 


1  1  1  m+ 1 

m2  (m  +  l)2  (m  +  2)2  m 


175.  The  series  s.  By  far  the  most  important  application  of 
the  Integral  Test  is  to  the  series 


l"s  +  2"s  +  +  •  •  •  +  rTs  +  . . . , 


where  s  is  any  rational  number.  We  have  seen  already  (§  77  and 
Exs.  lxvii.  14,  lxix.  1)  that  the  series  is  divergent  when  s  —  1. 

If  s  E[  0  then  it  is  obvious  that  the  series  is  divergent.  If  s  >  0  then  un 
decreases  as  n  increases,  and  we  can  apply  the  test.  Here 


$(0  = 


e1-*  - 1 

1  —  s 


unless  s  =  1.  If  s  >  1  then  s  — >  0  as  f  — >  oo,  and 


1 


l, 


say.  And  if  s  <  1  then  £1_s  — *  oo  as  £  — »  oo,  and  so  <!>(£)  — >  cx).  Thus 
the  series  Y2n~s  convergent  if  s  >  1,  divergent  if  s  A  1,  and  in  the  first 
case  its  sum  is  less  than  s/(s  —  1). 
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So  far  as  divergence  for  s  <  1  is  concerned,  this  result  might  have  been 
derived  at  once  from  comparison  with  ^(1/tj),  which  we  already  know  to  be 
divergent. 

It  is  however  interesting  to  see  how  the  Integral  Test  may  be  applied  to  the 
series  ^(1/tj),  when  the  preceding  analysis  fails.  In  this  case 


m  = 


and  it  is  easy  to  see  that  $(£)  ->  oo  as  £  — >  oo.  For  if  £  >  2n  then 


S(0> 


dx  f 4  dx 

- h  / - 1 - + 

X  J  2  X 


dx 

x 


But  by  putting  x  =  2 ru  we  obtain 


and  so  <£(£)  >  n 


du 

5 

u 


which  shows  that  <£(£)  — >  oo  as  £  — >  oo. 


Examples  LXXI.  1.  Prove  by  an  argument  similar  to  that  used  above, 

r  £  d  jQ 

and  without  integration,  that  <F(£)  =  /  — ,  where  s  <  1,  tends  to  infinity 

J 1  xs 

with  £. 

2.  The  series  ^n~2,  ^]n-3/2,  J[]n_11//10  are  convergent,  and  their  sums 
are  not  greater  than  2,  3,  11  respectively.  The  series  ^ n _1/2,  ^n~10/n  are 
divergent. 

3.  The  series  ’Yhn11  /(n*  +  a),  where  a  >  0,  is  convergent  or  divergent  ac¬ 
cording  as  t  >  1  +  s  or  t  ^  1  +  s.  [Compare  with  ^  ns~*  ] 

4.  Discuss  the  convergence  or  divergence  of  the  series 

Y,(ainSl  +  a2nS2  -| - h  aknSk) / (bin*1  +  b2n *2  H - b  bin tl), 


where  all  the  letters  denote  positive  numbers  and  the  s’s  and  t’s  are  rational 
and  arranged  in  descending  order  of  magnitude. 
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5.  Prove  that 


2  y/n  —  2  <  — =  4 — j=  +  •  •  •  4 — -j=  <  2 \Jn  —  1, 
VI  v  2  Vn 


^<i7r  +  i71  +  I75  +  "'<5(’r  +  1)' 

(Math.  Trip.  1911.) 

6.  If  0(n)  — >  l  >  1  then  the  series  ^  is  convergent.  If  0(n)  — >■  l  <  1 

then  it  is  divergent. 


176.  Cauchy’s  Condensation  Test.  The  second  of  the  two  tests 
mentioned  in  §  172  is  as  follows:  if  un  =  0(n)  is  a  decreasing  function  ofn, 
then  the  series  Y2  <f>(n)  is  convergent  or  divergent  according  as  ^  2n<f(2n)  is 
convergent  or  divergent. 

We  can  prove  this  by  an  argument  which  we  have  used  already  (§  77) 
in  the  special  case  of  the  series  ^(l/n).  In  the  first  place 

0(3) +  0(4)  ^20(4), 

0(5) +  0(6)  + •••  +  0(8)  ^40(8), 


0(2n  +  1)  +  0(2n  +  2)  +  •  •  •  +  0(2n+1)  ^  2n0(2n+1). 

If  2n0(2n)  diverges  then  so  do  2n+10(2n+1)  and  ^  2Tl0(2ri+1),  and 
then  the  inequalities  just  obtained  show  that  Jj)0(n)  diverges. 

On  the  other  hand 

0(2)  +  0(3)  ^  20(2),  0(4)  +  0(5)  +  •  •  •  +  0(7)  ^  40(4), 

and  so  on.  And  from  this  set  of  inequalities  it  follows  that  if  ^  2n0(2n) 
converges  then  so  does  ^0(n).  Thus  the  theorem  is  established. 

For  our  present  purposes  the  held  of  application  of  this  test  is  practically 
the  same  as  that  of  the  Integral  Test.  It  enables  us  to  discuss  the  series 
Y2  n~s  with  equal  ease.  For  Yf,  n~s  will  converge  or  diverge  according  as 
2n2~ns  converges  or  diverges,  i.e.  according  as  s  >  1  or  s  ^  1. 
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Examples  LXXII.  1.  Show  that  if  a  is  any  positive  integer  greater 
than  1  then  ^  0(n)  is  convergent  or  divergent  according  as  an0(an )  is  con¬ 
vergent  or  divergent.  [Use  the  same  arguments  as  above,  taking  groups  of  a,  a2, 
a3,  ...  terms.] 

2.  If  ]T]2Tl0(2Tl)  converges  then  it  is  obvious  that  lim2n0(2n)  =  0.  Hence 
deduce  Abel’s  Theorem  of  5  173. 


177.  Infinite  Integrals.  The  Integral  Test  of  §  174  shows  that,  if 
0(x)  is  a  positive  and  decreasing  function  of  x,  then  the  series  ^  0(n)  is 
convergent  or  divergent  according  as  the  integral  function  $(a;)  does  or 
does  not  tend  to  a  limit  as  x  — »  oo.  Let  us  suppose  that  it  does  tend  to  a 
limit,  and  that 

lim  /  0(f)  dt  =  l. 

£—>■00  J  ^ 

Then  we  shall  say  that  the  integral 

0(f)  dt 

is  convergent,  and  has  the  value  l ;  and  we  shall  call  the  integral  an  infi¬ 
nite  integral. 

So  far  we  have  supposed  0(f)  positive  and  decreasing.  But  it  is  natural 
to  extend  our  definition  to  other  cases.  Nor  is  there  any  special  point  in 
supposing  the  lower  limit  to  be  unity.  We  are  accordingly  led  to  formulate 
the  following  definition: 

//0(f)  is  a  function  oft  continuous  when  f  ^  a,  and 


lim  f  0(f)  dt  =  /, 

Ja 


then  we  shall  say  that  the  infinite  integral 


(1) 


is  convergent  and  has  the  value  l. 
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The  ordinary  integral  between  limits  a  and  A ,  as  defined  in  Ch.  VII, 
we  shall  sometimes  call  in  contrast  a  finite  integral. 

On  the  other  hand,  when 


4>(t)  dt  — >  oo, 

we  shall  say  that  the  integral  diverges  to  oo,  and  we  can  give  a  similar 
definition  of  divergence  to  — oo.  Finally,  when  none  of  these  alternatives 
occur,  we  shall  say  that  the  integral  oscillates ,  finitely  or  infinitely,  as 
x  — *  oo. 

These  definitions  suggest  the  following  remarks. 

(i)  If  we  write 

c t>(t )  dt  =  ^>(x), 

then  the  integral  converges,  diverges,  or  oscillates  according  as  <h(x)  tends  to  a 
limit,  tends  to  oo  (or  to  — oo),  or  oscillates,  as  x  — >  oo.  If  <h(x)  tends  to  a  limit, 
which  we  may  denote  by  <I?(oo),  then  the  value  of  the  integral  is  $(oo).  More 
generally,  if  <h(x)  is  any  integral  function  of  <f{x),  then  the  value  of  the  integral 
is  <h(oo)  —  <h(a). 

(ii)  In  the  special  case  in  which  c j>(t)  is  always  positive  it  is  clear  that 
<3?(x)  is  an  increasing  function  of  x.  Hence  the  only  alternatives  are  convergence 
and  divergence  to  oo. 

(iii)  The  integral  (1)  of  course  depends  on  a,  but  is  quite  independent  of  t, 
and  is  in  no  way  altered  by  the  substitution  of  any  other  letter  for  t  (cf.  §  157). 

(iv)  Of  course  the  reader  will  not  be  puzzled  by  the  use  of  the  term  infinite 
integral  to  denote  something  which  has  a  definite  value  such  as  2  or  \t\.  The 
distinction  between  an  infinite  integral  and  a  finite  integral  is  similar  to  that 
between  an  infinite  series  and  a  finite  series:  no  one  supposes  that  an  infinite 
series  is  necessarily  divergent. 

rx 

(v)  The  integral  /  f>(t)  dt  was  defined  in  §§  156  and  157  as  a  simple  limit, 

J  a 

i.e.  the  limit  of  a  certain  finite  sum.  The  infinite  integral  is  therefore  the  limit  of 
a  limit,  or  what  is  known  as  a  repeated  limit.  The  notion  of  the  infinite  integral 
is  in  fact  essentially  more  complex  than  that  of  the  finite  integral,  of  which  it  is 
a  development. 
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(vi)  The  Integral  Test  of  §  174  may  now  be  stated  in  the  form:  if  4>(x)  is 
positive  and  steadily  decreases  as  x  increases,  then  the  infinite  series  <Kn) 

/OO  '  ^ 

4>(x)  dx  converge  or  diverge  together. 

(vii)  The  reader  will  find  no  difficulty  in  formulating  and  proving  theorems 
for  infinite  integrals  analogous  to  those  stated  in  (l)-(6)  of  §  77.  Thus  the  result 


analogous  to  (2)  is  that  if 
is  convergent  and 


cj>(x)  dx  is  convergent,  and  b  >  a,  then  /  <j>{x)  dx 


roc  rb  roc 

/  f(x)  dx=  <t>(x)  dx+  <f(x)  dx. 
J  a  J  a  J  b 


178.  The  case  in  which  <p(x)  is  positive.  It  is  natural  to  consider 
what  are  the  general  theorems,  concerning  the  convergence  or  divergence  of 
the  infinite  integral  (1)  of  §  177,  analogous  to  theorems  A-D  of  §  167.  That 
A  is  true  of  integrals  as  well  as  of  series  we  have  already  seen  in  §  177,  (ii). 
Corresponding  to  B  we  have  the  theorem  that  the  necessary  and  sufficient 
condition  for  the  convergence  of  the  integral  (1)  is  that  it  should  be  possible 
to  find  a  constant  K  such  that 


0(i)  dt  <  K 


for  all  values  of  x  greater  than  a. 

r  oo 

Similarly,  corresponding  to  C,  we  have  the  theorem:  if  /  f(x)  dx  is 

J  a 

convergent,  and  fifx)  A  Kfifx)  for  all  values  of  x  greater  than  a,  then 

ro o 

/  fi(x)  dx  is  convergent  and 


ip(x)  dx  A  K  /  cj)(x)  dx. 


We  leave  it  to  the  reader  to  formulate  the  corresponding  test  for  divergence. 

We  may  observe  that  d’Alembert’s  test  (§  168),  depending  as  it  does  on 
the  notion  of  successive  terms,  has  no  analogue  for  integrals;  and  that  the 
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analogue  of  Cauchy’s  test  is  not  of  much  importance,  and  in  any  case  could 
only  be  formulated  when  we  have  investigated  in  greater  detail  the  theory 
of  the  function  <j)(x)  =  rx,  as  we  shall  do  in  Cli.  IX.  The  most  important 
special  tests  are  obtained  by  comparison  with  the  integral 


(a  >  0), 


whose  convergence  or  divergence  we  have  investigated  in  §  175,  and  are 

/“OO 

as  follows:  if  <p{x)  <  Kx~s,  where  s  >  1,  when  x  ^  a,  then  /  tf{x)dx 

J  a 

is  convergent;  and  if  <f{x)  >  Kx~s ,  where  s  ^  1,  when  x  ^  a,  then  the 
integral  is  divergent;  and  in  particular,  if  lim  xs  cp(x )  =  l,  where  l  >  0,  then 
the  integral  is  convergent  or  divergent  according  as  s  >  1  or  s  ^  1 . 


There  is  one  fundamental  property  of  a  convergent  infinite  series  in  regard 
to  which  the  analogy  between  infinite  series  and  infinite  integrals  breaks  down. 
If  <?Kn)  is  convergent  then  cf>(n)  — >  0;  but  it  is  not  always  true,  even  when 

/*oo 

4>(x)  is  always  positive,  that  if  /  4>(x)  dx  is  convergent  then  <f(x)  — >  0. 

J  a 

Consider  for  example  the  function  4>(x)  whose  graph  is  indicated  by  the 
thick  line  in  the  figure.  Here  the  height  of  the  peaks  corresponding  to  the  points 
x  =  1,  2,  3,  . . .  is  in  each  case  unity,  and  the  breadth  of  the  peak  corresponding 
to  x  =  n  is  2/(n  +  l)2.  The  area  of  the  peak  is  l/(n  +  l)2,  and  it  is  evident  that, 


Fig.  50. 
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for  any  value  of 


<j>(x)  dx  < 


OO 


E 


1 

(n  +  l)2  ’ 


so  that 


4>(x)  dx  is  convergent;  but  it  is  not  true  that  (j>(x)  — *  0. 


Examples  LXXIII.  1.  The  integral 

f°°  axr  +  fix’-1  +  •  •  •  +  A 

Ja  Hx*  +  Bx*-1  +  •  •  •  +  L  X’ 

where  a  and  A  are  positive  and  a  is  greater  than  the  greatest  root  of  the  de¬ 
nominator,  is  convergent  if  s  >  r  +  1  and  otherwise  divergent. 

2.  Which  of  the  integrals 

r°°  dx  r°°  xdx  r°°  x2dx  r°°  x2dx 
Ja  c2  +  x2  ’  Ja  c2  +  x2  1  Ja  c2  T  x2  ’  Ja  a  +  2/3x2  +  yx4 

are  convergent?  In  the  first  two  integrals  it  is  supposed  that  a  >  0,  and  in  the 

last  that  a  is  greater  than  the  greatest  root  (if  any)  of  the  denominator. 

3.  The  integrals 


cos  xdx,  /  sin  xdx,  /  cos  (ax  +  P)  dx 


oscillate  finitely  as  £  — >•  oo. 
4.  The  integrals 


/  xcos  xdx,  /  x2  sin  xdx  /  xncos  (ax  +  P)dx, 
J  a  J  a  J  a 


where  n  is  any  positive  integer,  oscillate  infinitely  as  £  — >  oo. 

r 

5.  Integrals  to  — oo.  If  /  p(x)  dx  tends  to  a  limit  l  as  £  — >•  — oo,  then 

h 

r 

we  say  that  /  </>(x)  dx  is  convergent  and  equal  to  l.  Such  integrals  possess 


—  OO 
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properties  in  every  respect  analogous  to  those  of  the  integrals  discussed  in  the 
preceding  sections:  the  reader  will  find  no  difficulty  in  formulating  them. 

6.  Integrals  from  — oo  to  +oo.  If  the  integrals 


4>(x)  dx, 


dx 


are  both  convergent,  and  have  the  values  k,  l  respectively,  then  we  say  that 

poo 

/  dx 


is  convergent  and  has  the  value  k  +  l. 
7.  Prove  that 


r°  dx  r°°  dx  _ !  r°°  dx  _1 
J -  oo  1  +  X2  Jo  1  +  x<2  2  J- oo  1  +  X2  2 


8.  Prove  generally  that 


poo 

2  /  0(x2) 

Jo 


dx, 


provided  that  the  integral  /  </>(x2)  dx  is  convergent. 

'  o 


9.  Prove 


POO  POO 

that  if  /  x(j)(x2)  dx  is  convergent  then  /  x(f>(x2)  dx  =  0. 

J  0  J — oo 

10.  Analogue  of  Abel’s  Theorem  of  §  173.  If(f>(x )  is  positive  and  steadily 

POO 

decreases,  and  /  4>(x)  dx  is  convergent,  then  x< j>(x)  — >  0.  Prove  this  (a)  by 

J  a 

means  of  Abel’s  Theorem  and  the  Integral  Test  and  ( b )  directly,  by  arguments 
analogous  to  those  of  §  173. 

11.  If  a  =  x o  <  x\  <  X2  <  ■  ■  ■  and  xn 


(*'^n-\- 1 


the  convergence  of 


oo,  and  un  =  /  4>(x)  dx,  then 

J  Xn 

4>{x)  dx  involves  that  of  Y2  un ■  If  is  always  positive 


J  a 

the  converse  statement  is  also  true.  [That  the  converse  is  not  true  in  general  is 
shown  by  the  example  in  which  <j>(x)  =  cosx,  xn  =  n7r.] 
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179.  Application  to  infinite  integrals  of  the  rules  for  substitu¬ 
tion  and  integration  by  parts.  The  rules  for  the  transformation  of  a 
definite  integral  which  were  discussed  in  §  161  may  be  extended  so  as  to 
apply  to  infinite  integrals. 

(1)  Transformation  by  substitution.  Suppose  that 


(1) 


is  convergent.  Further  suppose  that,  for  any  value  of  £  greater  than  a,  we 
have,  as  in  §  161, 


where  a  =  f(b),  £  =  /(r).  Finally  suppose  that  the  functional  relation 
x  =  f{t)  is  such  that  x  — *  oo  as  t  — >•  oo.  Then,  making  r  and  so  £  tend 
to  oo  in  (2),  we  see  that  the  integral 


(3) 


is  convergent  and  equal  to  the  integral  (1). 

On  the  other  hand  it  may  happen  that  £  — »  oo  as  r  — »  —  oo  or  as  r  — »  c. 
In  the  first  case  we  obtain 


In  the  second  case  we  obtain 


We  shall  return  to  this  equation  in  §  181. 
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There  are  of  course  corresponding  results  for  the  integrals 


4>(x)  dx, 


4>{x)  dx, 


which  it  is  not  worth  while  to  set  out  in  detail:  the  reader  will  be  able  to 
formulate  them  for  himself. 

Examples  LXXIV.  1.  Show,  by  means  of  the  substitution  x  =  ta , 
that  if  s  >  1  and  a  >  0  then 

/oo  COO 

x~s  dx  =  a  J  ^(i-s)-i  dt; 

and  verify  the  result  by  calculating  the  value  of  each  integral  directly. 

COO 

2.  If  /  4>(x)  dx  is  convergent  then  it  is  equal  to  one  or  other  of 

J  a 


'(a-p)/oL 


f(a-t 3)/a 

cf>(at  +  (3)  dt,  —a  4>(at  +  (5)  dt, 

J  —  OO 


according  as  a  is  positive  or  negative. 

3.  If  4>(x)  is  a  positive  and  steadily  decreasing  function  of  x,  and  a  and  f3 
are  any  positive  numbers,  then  the  convergence  of  the  series  4>{n)  implies  and 
is  implied  by  that  of  the  series  Yh  4>{an  +  &)■ 

[It  follows  at  once,  on  making  the  substitution  x  =  at  +  j3,  that  the  integrals 


/  (j>(x)dx,  /  (j)(at  +  P)dt 

J  a  J  (a—/3)/a 

converge  or  diverge  together.  Now  use  the  Integral  Test.] 

4.  Show  that 


(1  +  x)^/x 


=  h- 


[Put  X  =  t2.] 

5.  Show  that 


dx  =  ivr 

(1  +  x)2  2 
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[Put  x  =  t2  and  integrate  by  parts.] 

6.  If  4>{x)  — >  h  as  x  — >  oo,  and  4>(x)  — >  k  as  x  — >  — oo,  then 


{4>(x  —  a)  —  cj)(x  —  b )}  dx  =  —(a  —  b){h  —  k). 


[For 


/{4>(x  —  a)  —  4>(x  —  6)}  dx=  4>(x  —  a)  dx  —  /  cf>(x  —  b) 

./-C  J-i' 

r£-a  r£~b 

=  /  (j>(t)dt  -  /  <j){t) 

J-e-a  J-f'-b 


—  b)  dx 
dt 


•-?- 
r-e-b 


-e-b 

t-b 


r-Z-o  rK-o 

/  i j)(t)  dt  —  <f>(t)  dt. 

J—£'—a  J  f—a 


a  J  a 

The  first  of  these  two  integrals  may  be  expressed  in  the  form 

r-C-b 


r-K-o 

(a  —  b)k  +  pdt, 

J-i’-a 


where  p  — >  0  as  — >  oo,  and  the  modulus  of  the  last  integral  is  less  than  or 
equal  to  |a  —  b\n,  where  n  is  the  greatest  value  of  p  throughout  the  interval 
[— £'  —  a,  —  £'  —  b\.  Hence 

r-t'-b 

/  4>(t)  dt  -»  (a  —  b)k. 

J-Z-a 

The  second  integral  may  be  discussed  similarly.] 

(2)  Integration  by  parts.  The  formula  for  integration  by  parts 
(§  161)  is 


f{x)4>{x)  dx  =  /(O0(O  -  f  (a)(1)(a) 


f(x)4>(x)  dx. 


Suppose  now  that  ^  >  oo.  Then  if  any  two  of  the  three  terms  in  the 

above  equation  which  involve  £  tend  to  limits,  so  does  the  third,  and  we 
obtain  the  result 

POO  POO 

/  f(x)(j),(x)  dx  =  lim  /(£)</>(£)  -  / (a)(1)(a)  -  /  f'(x)<j>(x)  dx. 

Ja  Ja 

There  are  of  course  similar  results  for  integrals  to  —  oo,  or  from  —  oo  to  oo. 
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Examples  LXXV.  1.  Show  that 


x 

(1  +  x)3 


dx 


,  r°°  dx 

~2Jo  (TT^F 


2. 

3. 


f 


x 


(1  +  x)‘ 


■  dx 


=  1 

Jo 


X  A  1 
— - rrr  dx  =  ~  . 

(l  +  ic)3  3 


If  m  and  n  are  positive  integers,  and 

xmdx 

m’n  ~  L  (i+x)m+n' 


i 

2' 


then 

Im,n  —  {ui/ T  n.  1 

Hence  prove  that  Im,n  =  m!  (n  ~  2 )!/(m  +  n  —  1)!. 
4.  Show  similarly  that  if 

r  _  f°°  x2m+1dx 

m’n  ~  J0  (1  +  x2)m+n 

then 


Im,n  =  {m/ ( m  +  n  -  l)}Im-i,n ,  2/m,n  =  m!  (n  -  2)!/ (m  +  n  -  1)!. 

Verify  the  result  by  applying  the  substitution  x  =  t2  to  the  result  of  Ex.  3. 

180.  Other  types  of  infinite  integrals.  It  was  assumed,  in  the 
definition  of  the  ordinary  or  finite  integral  given  in  Ch.  VII,  that  (1)  the 
range  of  integration  is  finite  and  (2)  the  subject  of  integration  is  continuous. 

It  is  possible,  however,  to  extend  the  notion  of  the  ‘definite  integral’  so 
as  to  apply  to  many  cases  in  which  these  conditions  are  not  satisfied.  The 
‘infinite’  integrals  which  we  have  discussed  in  the  preceding  sections,  for 
example,  differ  from  those  of  Ch.  VII  in  that  the  range  of  integration  is 
infinite.  We  shall  now  suppose  that  it  is  the  second  of  the  conditions  (1),  (2) 
that  is  not  satisfied.  It  is  natural  to  try  to  frame  definitions  applicable  to 
some  such  cases  at  any  rate.  There  is  only  one  such  case  which  we  shall 
consider  here.  We  shall  suppose  that  (j)(x)  is  continuous  throughout  the 
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range  of  integration  [a,  A]  except  for  a  finite  number  of  values  of  x,  say 
x  —  £1,  £2,  ■  •  • ,  and  that  (p(x)  — )■  00  or  cj)(x)  — >  —00  as  x  tends  to  any  of 
these  exceptional  values  from  either  side. 

It  is  evident  that  we  need  only  consider  the  case  in  which  [a,  A]  contains 
one  such  point  £.  When  there  is  more  than  one  such  point  we  can  divide 
up  [a,  A]  into  a  finite  number  of  sub-intervals  each  of  which  contains  only 
one;  and,  if  the  value  of  the  integral  over  each  of  these  sub-intervals  has 
been  defined,  we  can  then  define  the  integral  over  the  whole  interval  as 
being  the  sum  of  the  integrals  over  each  sub-interval.  Further,  we  can 
suppose  that  the  one  point  £  in  [a,  A]  comes  at  one  or  other  of  the  limits 

fA 

a,  A.  For,  if  it  comes  between  a  and  A,  we  can  then  define  /  cp(x)  dx  as 


4>(x)  dx+  cj)(x)  dx , 


assuming  each  of  these  integrals  to  have  been  satisfactorily  defined.  We 
shall  suppose,  then,  that  £  =  a;  it  is  evident  that  the  definitions  to  which 
we  are  led  will  apply,  with  trifling  changes,  to  the  case  in  which  £  =  A. 

Let  us  then  suppose  4>(x)  to  be  continuous  throughout  [a,  A]  except  for 
x  =  a,  while  £>(x)  — >  00  as  x  — »  a  through  values  greater  than  a.  A  typical 
example  of  such  a  function  is  given  by 


cp(x)  —  (x  —  a)  s, 

where  s  >  0;  or,  in  particular,  if  a  =  0,  by  (p(x)  =  x~s.  Let  us  therefore 
consider  how  we  can  define 

(i) 

Jo  Xs 

when  s  >  0. 

/»oo 

The  integral  /  ys~2  dy  is  convergent  if  s  <  1  (§  175)  and  means 

Jl/A 


lim 

77— >-00 


'1 M 


ys  2  dy.  But  if  we  make  the  substitution  y  —  1/x,  we  obtain 


/  ys  dy=  x  s  dx. 

’  l/A  Jl/ri 
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Thus  lim 

7/— )-00 


1  h 


x  s  dx,  or,  what  is  the  same  thing, 


lim 

e— s>+0 


x  s  dx, 


exists  provided  that  s  <  1;  and  it  is  natural  to  define  the  value  of  the 
integral  (1)  as  being  equal  to  this  limit.  Similar  considerations  lead  us  to 


define 


(x  —  a)  s  dx  by  the  equation 


(x  —  a)  s  dx  =  lim  /  (x  —  a)  s  dx. 

6-5-+0  ' 


'  a+e 


We  are  thus  led  to  the  following  general  definition:  if  the  integral 

rA 

/  <f(x)  dx 


'  a+e 


tends  to  a  limit  l  as  e  — *  +0,  we  shall  say  that  the  integral 

rA 

/  <f(x)  dx 


is  convergent  and  has  the  value  l. 

Similarly,  when  <f>(x)  — >  00  as  x  tends  to  the  upper  limit  A,  we  define 

[A 

/  <f(x)  dx  as  being 

J  a 

pA—e 

lim  /  Six)  dx  : 
e^+0  J  a 

and  then,  as  we  explained  above,  we  can  extend  our  definitions  to  cover 
the  case  in  which  the  interval  [a,  A]  contains  any  finite  number  of  infinities 
of  (j,{x). 

An  integral  in  which  the  subject  of  integration  tends  to  00  or  to  —  00 
as  x  tends  to  some  value  or  values  included  in  the  range  of  integration  will 
be  called  an  infinite  integral  of  the  second  kind :  the  first  kind  of  infinite 
integrals  being  the  class  discussed  in  §§  177  et  seq.  Nearly  all  the  remarks 
(i)-(vii)  made  at  the  end  of  §  177  apply  to  infinite  integrals  of  the  second 
kind  as  well  as  to  those  of  the  first. 


[VIII :  181]  THE  CONVERGENCE  OF  INFINITE  SERIES,  ETC. 


410 


181.  We  may  now  write  the  equation  (4)  of  §  179  in  the  form 

/♦oo  rc 

/  4>(x)dx=  (j>{f(t)}f'(t)dt.  (1) 

J  a  J  b 

The  integral  on  the  right-hand  side  is  defined  as  the  limit,  as  r  — >  c,  of  the 
corresponding  integral  over  the  range  [b,  r],  i.e.  as  an  infinite  integral  of  the 
second  kind.  And  when  <t>{f(t)}f'(t)  has  an  infinity  at  t  =  c  the  integral  is 
essentially  an  infinite  integral.  Suppose  for  example,  that  (p{x)  =  (1  +  x)~m, 
where  1  <  m  <  2,  and  a  =  0,  and  that  f(t)  =  t/(  1  —  t).  Then  b  =  0,  c  =  1,  and 
(1)  becomes 

f°°  dx  =  f1  (1  —  t)m~2  dt]  (2) 

J0  (1  +  x)™  .7o 

and  the  integral  on  the  right-hand  side  is  an  infinite  integral  of  the  second  kind. 

On  the  other  hand  it  may  happen  that  is  continuous  for  t  =  c. 

In  this  case 

Jb 

is  a  finite  integral,  and 

lim  [  cj>{f(t)}f'(t)dt=  f  4>{f{t)}f(t)dt, 

T->c  Jb  Jb 

in  virtue  of  the  corollary  to  Theorem  (10)  of  §  160.  In  this  case  the  substitution 
x  =  f(t)  transforms  an  infinite  into  a  finite  integral.  This  case  arises  if  m  ^  2 
in  the  example  considered  a  moment  ago. 


1.  If  4>{x)  is  continuous  except  for  x  =  a,  while 

rA 


Examples  LXXVI 

<j){x)  — >  oo  as  x  — >  a,  then  the  necessary  and  sufficient  condition  that 
should  be  convergent  is  that  we  can  find  a  constant  K  such  that 

pA 


4>{x)  dx 


(j>(x)  dx  <  K 


'  a+e 


for  all  values  of  e,  however  small  (cf.  §  178). 
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It  is  clear  that  we  can  choose  a  number  A'  between  a  and  A,  such  that 
c t>(x )  is  positive  throughout  [a,  A'].  If  cj)(x )  is  positive  throughout  the  whole 
interval  [a,  A]  then  we  can  of  course  identify  A'  and  A.  Now 


rA'  rA 

4>(x)  dx  =  /  4>(x)  dx+  (j)( x)  dx. 

J  a—e  J A' 


The  first  integral  on  the  right-hand  side  of  the  above  equation  increases  as 
e  decreases,  and  therefore  tends  to  a  limit  or  to  oo;  and  the  truth  of  the  result 
stated  becomes  evident. 


If  the  condition  is  not  satisfied  then 
(•A 


4>{x)  dx  — >  oo.  We  shall  then  say 


that  the  integral  /  (j>{x)  dx  diverges  to  oo.  It  is  clear  that,  if  4>(x)  — >  oo  as 

J  a 

x  — >  a  +  0,  then  convergence  and  divergence  to  oo  are  the  only  alternatives  for 
the  integral.  We  may  discuss  similarly  the  case  in  which  (p(x)  -A  —oo. 

2.  Prove  that 

rA  {A  -  a)1-s 


\x  —  a) 


1  dx  = 


1  —  s 


if  s  <  1,  while  the  integral  is  divergent  if  s  ^  1. 

3.  If  (j)(x )  — >  oo  as  i  - )>a  +  0  and  4>(x)  <  K{x  —  a)~s,  where  s  <  1,  then 

rA 

/  (f){x)dx  is  convergent;  and  if  cj)(x )  >  K(x  —  a)  s,  where  s  ^  1,  then  the 
J  a 

integral  is  divergent.  [This  is  merely  a  particular  case  of  a  general  comparison 
theorem  analogous  to  that  stated  in  §  178.] 

4.  Are  the  integrals 


dx 


dx 


dx 


L  ^/{x  -  a){A  -  x)  ’ 
fA  dx  fA 


( A  —  x)y/x  —  a’ 
dx  fA  dx 


{A  —  x)^/ A  —  x' 
fA  dx 


J  a  yjx2  -  a2  ’  J  a 

convergent  or  divergent? 

5.  The  integrals 


^A3-*3’ 


9  9  ’ 

xz  —  az 


A3  —  x3 


"l  dx 


f‘CL-\-\ 


dx 


/- 1 


v^’  Ja-l  \Jx- a 


are  convergent,  and  the  value  of  each  is  zero. 
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6.  The  integral 


dx 


o  V Sill  X 


is  convergent.  [The  subject  of  integration  tends  to  oo  as  x  tends  to  either  limit. 
7.  The  integral 


dx 


(sinx)6 


is  convergent  if  and  only  if  s  <  1. 
8.  The  integral 


is  convergent  if  t  <  s  +  1. 
9.  Show  that 


(sin  x )' 


dx 


rh 


sun 


dx, 


Jo  xP 

where  h  >  0,  is  convergent  if  p  <  2.  Show  also  that,  if  0  <  p  <  2,  the  integrals 


smx 

xP 


r*27T 


dx, 


smi 

xP 


dx, 


rSn 


/2tt 


smx 

xP 


dx,  . . . 


alternate  in  sign  and  steadily  decrease  in  absolute  value.  [Transform  the  integral 
whose  limits  are  kir  and  (k  +  l)ir  by  the  substitution  x  =  kir  +  y .} 

10.  Show  that 


smx 


dx, 


Jo  xp 

where  0  <  p  <  2,  attains  its  greatest  value  when  h  =  tt.  (Math.  Trip.  1911.) 

1 1 .  The  integral 


'l7r 


(cos  x)l(sm  x)m  dx 


is  convergent  if  and  only  if  l  >  —  1,  m  >  —  1. 

12.  Such  an  integral  as 

r°°  Xs -1  dx 

In  1  +  X 
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where  s  <  1,  does  not  fall  directly  under  any  of  our  previous  definitions.  For 
the  range  of  integration  is  infinite  and  the  subject  of  integration  tends  to  oo  as 
x  — >  +0.  It  is  natural  to  define  this  integral  as  being  equal  to  the  sum 


"*  xs  1  dx 


1  +  x 


+ 


r°°  Xs -1  dx 

Jl  1  +  x  ’ 


provided  that  these  two  integrals  are  both  convergent. 

The  first  integral  is  a  convergent  infinite  integral  of  the  second  kind  if 
0  <  s  <  1.  The  second  is  a  convergent  infinite  integral  of  the  first  kind  if 
s  <  1.  It  should  be  noted  that  when  s  >  1  the  first  integral  is  an  ordinary 
finite  integral;  but  then  the  second  is  divergent.  Thus  the  integral  from  0  to  oo 
is  convergent  if  and  only  if  0  <  s  <  1. 

13.  Prove  that 

r °  xs~l  , 

/  1 - 7  dx 

Jo  l  +  xt 

is  convergent  if  and  only  if  0  <  s  <  t. 

14.  The  integral 


xs-i_xi~i 

- dx 

1  —  x 


is  convergent  if  and  only  if0<s<l,0<f<l.  [It  should  be  noticed  that  the 
subject  of  integration  is  undefined  when  x  =  1;  but  (x^1  —  x*^1)/ (1  —  x)  —tt  —  s 
as  x  — >  1  from  either  side;  so  that  the  subject  of  integration  becomes  a  continuous 
function  of  x  if  we  assign  to  it  the  value  t  —  s  when  x  =  1. 

It  often  happens  that  the  subject  of  integration  has  a  discontinuity  which 
is  due  simply  to  a  failure  in  its  definition  at  a  particular  point  in  the  range 
of  integration,  and  can  be  removed  by  attaching  a  particular  value  to  it  at 
that  point.  In  this  case  it  is  usual  to  suppose  the  definition  of  the  subject  of 
integration  completed  in  this  way.  Thus  the  integrals 


sm  mx 


dx, 


sm  mx 

- dx 

smx 


are  ordinary  finite  integrals,  if  the  subjects  of  integration  are  regarded  as  having 
the  value  m  when  x  =  0.] 

15.  Substitution  and  integration  by  parts.  The  formulae  for  transfor¬ 
mation  by  substitution  and  integration  by  parts  may  of  course  be  extended  to 
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infinite  integrals  of  the  second  as  well  as  of  the  first  kind.  The  reader  should 
formulate  the  general  theorems  for  himself,  on  the  lines  of  §  179. 

16.  Prove  by  integration  by  parts  that  if  s  >  0,  t  >  1,  then 

[  xs_1(l  -x)t~1dx  =  - — -  [  xs {1  —  x)1^2  dx . 

Jo  s  Jo 


17.  If  s  >  0  then 


[Put  x  =  1/1.] 

18.  If  0  <  s  <  1  then 

f 1  xs_1  +  x~s 

Jo  1  +  x 

19.  If  a  +  b  >  0  then 


b1  Xs"1  dx  _  f°°  r 

1  +  x  J i  1 


1  dt 


+  t 


dx  = 


L 


°°  t~s  dt  _  r°°  t8-1  dt 

1  + 1  Jo  1  +  t 


jf 


dx 


TV 


(x  +  a)  y/x  —  b  y/a  +  b 


(Math.  Trip.  1909.) 

20.  Show,  by  means  of  the  substitution  x  =  t/(  1  —  t),  that  if  l  and  m  are 
both  positive  then 


roa  l-l  pi 

/  ,  dx=  /  tl-\l-t)m-ldt. 

Jo  (l  +  x)l+m  Jo  K  f 


21.  Show,  by  means  of  the  substitution  x  =  pt/ (p+l  —  t),  that  if  l,  m,  and  p 
are  all  positive  then 


\m—  1 


dx 


1 


/  xl~1(l  —  xY  - 

/o  1  J  (x  +  p)l+m  (l+p)lpmJo 


tl~\l  -t)m~Ldt. 


m—1 


22.  Prove  that 
fb  dx 


y/  (x  —  a)  (b  —  x) 


=  7 r  and 


x  dx 


y/ (x  a)(b  -  x) 


=  ^7r(a  +  b ), 
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(i)  by  means  of  the  substitution  x  =  a+(b—a)t2 ,  (ii)  by  means  of  the  substitution 
(b  —  x)/(x  —  a)  =  t,  and  (iii)  by  means  of  the  substitution  x  =  a  cos2 1  +  b  sin2  t. 
23.  If  s  >  —  1  then 


(sin  Q)sd6  = 


xs  dx 
V7!  —  x2 


x^s  ^  dx 
V7!  —  x 


24.  Establish  the  formulae 
f1  f(x)  dx  f 


1 0  Vl  -  x2  Jo 
fb  f(x)  dx 


/(sin#)  dO , 


=  2 


y/(x  —  a)(b  -  x)  Jo 
f 


f(a  cos2  0  +  b  sin2  9)  dO, 


a  —  x 
a  +  x 


dx  =  4a  /  /(tan  6)  cos  6  sin  9  d9. 

Jo 


25.  Prove  that 


dx 


lo  (1  +  x){2  +  x)i/x(l  -  x) 


=  7 r 


[Put  x  =  sin2  9  and  use  Ex.  lxiii.  8.] 


(Math.  Trip.  1912.) 


182.  Some  care  has  occasionally  to  be  exercised  in  applying  the  rule  for 
transformation  by  substitution.  The  following  example  affords  a  good  illustra¬ 
tion  of  this. 

Let 

J  =  j  (x2  —  6x  +  13)  dx. 

We  find  by  direct  integration  that  J  =  48.  Now  let  us  apply  the  substitution 


y  =  xr  —  6x  +  13, 


which  gives  x  =  3  ±  \Jy  —  4.  Since  y  =  8  when  x  =  1  and  y  =  20  when  x  =  7, 
we  appear  to  be  led  to  the  result 
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The  indefinite  integral  is 

i(»-4)s/2  +  4(!/-4)V2, 

and  so  we  obtain  the  value  which  is  certainly  wrong  whichever  sign  we 

choose. 

The  explanation  is  to  be  found  in  a  closer  consideration  of  the  relation  be¬ 
tween  x  and  y.  The  function  x 2  —  6.x  +  13  has  a  minimum  for  x  =  3,  when  y  =  4. 
As  x  increases  from  1  to  3 ,  y  decreases  from  8  to  4,  and  dx/dy  is  negative,  so 
that 

dx  1 

dy  2 y/y  -  4' 

As  x  increases  from  3  to  7 ,  y  increases  from  4  to  20,  and  the  other  sign  must  be 
chosen.  Thus 

J=  f  ydx=  f  { - -L=\dy+  [  V  dy, 

J  i  J si  2,^4]  i4  2^y^4  y 

a  formula  which  will  be  found  to  lead  to  the  correct  result. 

/»7T 

Similarly,  if  we  transform  the  integral  /  dx  =  n  by  the  substitution 

Jo _ 

x  =  arc  sin  y,  we  must  observe  that  dx/dy  =  1/  i/l  —  y2  or  dx/dy  =  —  1/  \/l  —  y2 
according  as  0  ^  x  <  ^7r  or  4^  <  x  ^  7r. 

Example.  Verify  the  results  of  transforming  the  integrals 

/*1  /*7T 

/  (4x2  —  x  +  A)  dx,  /  cos 2xdx 

Jo  Jo 

by  the  substitutions  4x2  —  x  +  A=y,  x  =  arc  sin  y  respectively. 

183.  Series  of  positive  and  negative  terms.  Our  definitions  of 
the  sum  of  an  infinite  series,  and  the  value  of  an  infinite  integral,  whether  of 
the  first  or  the  second  kind,  apply  to  series  of  terms  or  integrals  of  functions 
whose  values  may  be  either  positive  or  negative.  But  the  special  tests  for 
convergence  or  divergence  which  we  have  established  in  this  chapter,  and 
the  examples  by  which  we  have  illustrated  them,  have  had  reference  almost 
entirely  to  the  case  in  which  all  these  values  are  positive.  Of  course  the 
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case  in  which  they  are  all  negative  is  not  essentially  different,  as  it  can  be 
reduced  to  the  former  by  changing  un  into  —un  or  <f>(x)  into 

In  the  case  of  a  series  it  has  always  been  explicitly  or  tacitly  assumed 
that  any  conditions  imposed  upon  un  may  be  violated  for  a  finite  number  of 
terms:  all  that  is  necessary  is  that  such  a  condition  {e.g.  that  all  the  terms 
are  positive)  should  be  satisfied  from  some  definite  term  onwards.  Similarly 
in  the  case  of  an  infinite  integral  the  conditions  have  been  supposed  to  be 
satisfied  for  all  values  of  x  greater  than  some  definite  value ,  or  for  all  values 
of  x  within  some  definite  interval  [a,  a  +  <5]  which  includes  the  value  a  near 
which  the  subject  of  integration  tends  to  infinity.  Thus  our  tests  apply  to 
such  a  series  as 

n2  —  10 

n4  ’ 

since  n2  —  10  >  0  when  n  if  4,  and  to  such  integrals  as 

3x  —  7  ,  f1  1  -  2a; 

7 — rwry  dx,  /  — -=-  dx, 

[x  + 1)3  Jo  \fx 

since  3a;  —  7  >  0  when  x  >  and  1  —  2x  >  0  when  0  <  x  < 

But  when  the  changes  of  sign  of  un  persist  throughout  the  series, 
i.e.  when  the  number  of  both  positive  and  negative  terms  is  infinite,  as  in 
the  series  1  —  \  +  \  —  |  +  . . . ;  or  when  J>{x)  continually  changes  sign  as 
x  — V  oc,  as  in  the  integral 

sin  a; 

- dx, 

Xs 

or  as  x  — *  a,  where  a  is  a  point  of  discontinuity  of  <f>(x),  as  in  the  integral 


then  the  problem  of  discussing  convergence  or  divergence  becomes  more 
difficult.  For  now  we  have  to  consider  the  possibility  of  oscillation  as  well 
as  of  convergence  or  divergence. 

We  shall  not,  in  this  volume,  have  to  consider  the  more  general  problem 
for  integrals.  But  we  shall,  in  the  ensuing  chapters,  have  to  consider  certain 
simple  examples  of  series  containing  an  infinite  number  of  both  positive  and 
negative  terms. 
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184.  Absolutely  Convergent  Series.  Let  us  then  consider  a  series 
Y  un  in  which  any  term  may  be  either  positive  or  negative.  Let 

|^n| 

so  that  an  =  un  if  un  is  positive  and  an  =  —  un  if  un  is  negative.  Further,  let 
vn  =  un  or  vn  =  0,  according  as  un  is  positive  or  negative,  and  wn  =  —un 
or  wn  =  0,  according  as  un  is  negative  or  positive;  or,  what  is  the  same 
thing,  let  vn  or  wn  be  equal  to  an  according  as  un  is  positive  or  negative, 
the  other  being  in  either  case  equal  to  zero.  Then  it  is  evident  that  vn 
and  wn  are  always  positive,  and  that 

Un  Vn  W m  Oln  Vn  +  VJn. 


If,  for  example,  our  series  is  1  —  (1/2)2  +  (1/3)2  —  . . . ,  then  un  =  (— 1  )n~1/n2 
and  an  =  1/n2,  while  vn  =  1/n2  or  vn  =  0  according  as  n  is  odd  or  even  and 
wn  =  1/n2  or  wn  =  0  according  as  n  is  even  or  odd. 

We  can  now  distinguish  two  cases. 

A.  Suppose  that  the  series  Y2an  is  convergent.  This  is  the  case,  for 
instance,  in  the  example  above,  where  Y2  an  is 

1  +  (1/2)2  +  (1/3)2  +  . . . . 

Then  both  Y2  vn  and  Y2  wn  are  convergent:  for  (Ex.  XXX.  18)  any  series  se¬ 
lected  from  the  terms  of  a  convergent  series  of  positive  terms  is  convergent. 
And  hence,  by  theorem  (6)  of  §  77,  Y2  un  or  Y2(vn  ~  wn )  is  convergent  and 
equal  to  Y2vn~Y2wn- 

We  are  thus  led  to  formulate  the  following  definition. 

Definition.  When  Y2an  orYl  \un\  is  convergent,  the  series  Y2un 
said  to  be  absolutely  convergent. 

And  what  we  have  proved  above  amounts  to  this:  if  «n  is  absolutely 
convergent  then  it  is  convergent;  so  are  the  series  formed  by  its  positive 
and  negative  terms  taken  separately;  and  the  sum  of  the  series  is  equal  to 
the  sum  of  the  positive  terms  plus  the  sum  of  the  negative  terms. 
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The  reader  should  carefully  guard  himself  against  supposing  that  the  state¬ 
ment  ‘an  absolutely  convergent  series  is  convergent’  is  a  mere  tautology.  When 
we  say  that  Y  un  is  ‘absolutely  convergent’  we  do  not  assert  directly  that  Y  un  is 
convergent:  we  assert  the  convergence  of  another  series  Y  \un\,  and  it  is  by  no 
means  evident  a  priori  that  this  precludes  oscillation  on  the  part  of  Yun- 


Examples  LXXVII.  1.  Employ  the  ‘general  principle  of  convergence’ 
(§  84)  to  prove  the  theorem  that  an  absolutely  convergent  series  is  convergent. 
[Since  Y  \un\  is  convergent,  we  can,  when  any  positive  number  e  is  assigned, 
choose  no  so  that 

l^ni+ll  +  |Wni+2|  +  •  •  •  +  \un2\  <  € 
when  n2  >  n±  ^  n$.  A  fortiori 


Uni+l  +  ^rti+2  +  •  •  ■  +  Un2  \  <  6, 


and  therefore  Y  un  is  convergent.] 

2.  If  Y  an  is  a  convergent  series  of  positive  terms,  and  \bn\  ^  Kan,  then 

Y  bn  is  absolutely  convergent. 

3.  If  Y  an  is  a  convergent  series  of  positive  terms,  then  the  series  Y  anxn 
is  absolutely  convergent  when  —  1  ^  x  ^  1. 

4.  If  Y  an  is  a  convergent  series  of  positive  terms,  then  the  series 

Y  an  cos  n8,  Y  o-n  sin  nd  are  absolutely  convergent  for  all  values  of  6.  [Ex¬ 
amples  are  afforded  by  the  series  Y  7-71  cos  n6,  Y  rn  sin  n6  of  §  88.] 

5.  Any  series  selected  from  the  terms  of  an  absolutely  convergent  series  is 
absolutely  convergent.  [For  the  series  of  the  moduli  of  its  terms  is  a  selection 
from  the  series  of  the  moduli  of  the  terms  of  the  original  series.] 

6.  Prove  that  if  | |  is  convergent  then 

=  Y'j  I  a 7),  | , 

and  that  the  only  case  to  which  the  sign  of  equality  can  apply  is  that  in  which 
every  term  has  the  same  sign. 
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185.  Extension  of  Dirichlet’s  Theorem  to  absolutely  conver¬ 
gent  series.  Dirichlet’s  Theorem  (§  169)  shows  that  the  terms  of  a  se¬ 
ries  of  positive  terms  may  be  rearranged  in  any  way  without  affecting  its 
sum.  It  is  now  easy  to  see  that  any  absolutely  convergent  series  has  the 
same  property.  For  let  22  un  be  so  rearranged  as  to  become  22  u2  and  let 
a'n,  v'n,  w'n  be  formed  from  u'n  as  an,  vn,  wn  were  formed  from  un.  Then 
22  an  Is  convergent,  as  it  is  a  rearrangement  of  22  am  and  so  are  22v'm 
22  w2  which  are  rearrangements  of  22  vn,  22  Also,  by  Dirichlet’s  The¬ 
orem,  22  vn  =  22  vn  and  22  wn  =  22  wn  and  so 

22^'n  =  22^'n  ~  22™'n  =  J2^n  ~  22™n  =  22^n- 

186.  Conditionally  convergent  series.  B.  We  have  now  to  con¬ 
sider  the  second  case  indicated  above,  viz.  that  in  which  the  series  of  moduli 
22  an  diverges  to  oo. 

Definition.  If22un  w  convergent,  but  22  \un\  divergent,  the  original 
series  is  said  to  be  conditionally  convergent. 

In  the  first  place  we  note  that,  if  22  un  Is  conditionally  convergent,  then 
the  series  22  vn,  22  wn  of  §  184  must  both  diverge  to  oo.  For  they  obviously 
cannot  both  converge,  as  this  would  involve  the  convergence  of  22(vnJrwn) 
or  22  an ■  And  if  one  of  them,  say  22  wn-  is  convergent,  and  22  vn  divergent, 
then 

N  N  N 

Un  =  5Z Vn  ~  Wn’ 

0  0  0 

and  therefore  tends  to  oo  with  N,  which  is  contrary  to  the  hypothesis  that 
22  un  is  convergent. 

Hence  22vm  22  wn  are  both  divergent.  It  is  clear  from  equation  (1) 
above  that  the  sum  of  a  conditionally  convergent  series  is  the  limit  of 
the  difference  of  two  functions  each  of  which  tends  to  oo  with  n.  It  is 
obvious  too  that  22  un  no  longer  possesses  the  property  of  convergent  se¬ 
ries  of  positive  terms  (Ex.  xxx.  18),  and  all  absolutely  convergent  series 
(Ex.  LXXVII.  5),  that  any  selection  from  the  terms  itself  forms  a  conver¬ 
gent  series.  And  it  seems  more  than  likely  that  the  property  prescribed 
by  Dirichlet’s  Theorem  will  not  be  possessed  by  conditionally  convergent 
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series;  at  any  rate  the  proof  of  §  185  fails  completely,  as  it  depended  essen¬ 
tially  on  the  convergence  of  Y  vn  and  Y  wn  separately.  We  shall  see  in  a 
moment  that  this  conjecture  is  well  founded,  and  that  the  theorem  is  not 
true  for  series  such  as  we  are  now  considering. 

187.  Tests  of  convergence  for  conditionally  convergent  series. 

It  is  not  to  be  expected  that  we  should  be  able  to  find  tests  for  conditional 
convergence  as  simple  and  general  as  those  of  §§  167  et  seq.  It  is  naturally 
a  much  more  difficult  matter  to  formulate  tests  of  convergence  for  series 
whose  convergence,  as  is  shown  by  equation  (1)  above,  depends  essentially 
on  the  cancelling  of  the  positive  by  the  negative  terms.  In  the  first  instance 
there  are  no  comparison  tests  for  convergence  of  conditionally  convergent 
series. 

For  suppose  we  wish  to  infer  the  convergence  of  Y  vn  from  that  of  Y  un. 
We  have  to  compare 


v0  +  vi-\ - b  un,  u0  +  Ui  4 - h«„. 

If  every  u  and  every  v  were  positive,  and  every  v  less  than  the  correspond¬ 
ing  u,  we  could  at  once  infer  that 


Vo  +  Vi  4 - b  vn  <  u0  4 - b  un, 

and  so  that  Y  vn  is  convergent.  If  the  it’s  only  were  positive  and  every  v 
numerically  less  than  the  corresponding  u,  we  could  infer  that 

|u0|  +  M  4 - b  \vn\  <  u0  4 - b  un, 

and  so  that  Yvn  is  absolutely  convergent.  But  in  the  general  case,  when 
the  u’s  and  v’s  are  both  unrestricted  as  to  sign,  all  that  we  can  infer  is  that 

\v0\  +  |ui|  4 - b  \vn |  <  \u0\  4 - b  \un\. 

This  would  enable  us  to  infer  the  absolute  convergence  of  Yvn  from  the 
absolute  convergence  of  Yun]  but  if  Y  un  is  only  conditionally  convergent 
we  can  draw  no  inference  at  all. 
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Example.  We  shall  see  shortly  that  the  series  1— 5+5  —  \+-  ■  ■  is  convergent . 
But  the  series  j  +  ^  +  j  +  gT---  is  divergent,  although  each  of  its  terms  is 
numerically  less  than  the  corresponding  term  of  the  former  series. 

It  is  therefore  only  natural  that  such  tests  as  we  can  obtain  should  be 
of  a  much  more  special  character  than  those  given  in  the  early  part  of  this 
chapter. 


188.  Alternating  Series.  The  simplest  and  most  common  condi¬ 
tionally  convergent  series  are  what  is  known  as  alternating  series,  series 
whose  terms  are  alternately  positive  and  negative.  The  convergence  of  the 
most  important  series  of  this  type  is  established  by  the  following  theorem. 

If  <f)(n )  is  a  positive  function  of  n  which  tends  steadily  to  zero  as 
n  — »  00,  then  the  series 


m  -0(1)  +  0(2)-... 

is  convergent,  and  its  sum  lies  between  0(0)  and  0(0)  —  0(1). 
Let  us  write  0o,  0i,  ...  for  0(0),  0(1),  . . . ;  and  let 

sn  =  0o  —  01  +  02  —  •  •  •  +  (  —  l)ra0n- 


Then 


S2n+1  ~  s'2n-\  ~  02 n  ~  02n+l  =  0?  S2 n  ~  S2n-2  ~  ~~  (02n-l  —  02 n)  =  0. 

Hence  s0,  s2,  s4,  •  •  • ,  s2n,  •  •  •  is  a  decreasing  sequence,  and  therefore 
tends  to  a  limit  or  to  —  oo,  and  .Si ,  S3,  S5,  ...,  s2n+i,  •••  is  an 
increasing  sequence,  and  therefore  tends  to  a  limit  or  to  00.  But 
lim(s2n+i  —  s2n)  =  lim(— l)2n+102n+i  =  0,  from  which  it  follows  that  both 
sequences  must  tend  to  limits,  and  that  the  two  limits  must  be  the  same. 
That  is  to  say,  the  sequence  s0,  si;  . . . ,  sn,  ...  tends  to  a  limit.  Since 
s0  =  0o,  si  =  0o  —  0i,  it  is  clear  that  this  limit  lies  between  0O  and  0o  —  0i- 
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Examples  LXXVIII.  1.  The  series 

111  111 

1“2  +  3_4  +  -"’  +  + 

V  (-i)n  v  (-i)n  v  (-i)n  v  (-i)n 
^(n  +  a)’  ^  yfr+Z'  +  Z-(V^  +  ^)2’ 

where  a  >  0,  are  conditionally  convergent. 

2.  The  series  l)n(n  +  a)~s,  where  a  >  0,  is  absolutely  convergent  if 
s  >  1,  conditionally  convergent  if  0  <  s  E  1,  and  oscillatory  if  s  E  0. 

3.  The  sum  of  the  series  of  §  188  lies  between  sn  and  Sn+i  for  all  values 
of  n;  and  the  error  committed  by  taking  the  sum  of  the  first  n  terms  instead  of 
the  sum  of  the  whole  series  is  numerically  not  greater  than  the  modulus  of  the 
(n  +  l)th  term. 

4.  Consider  the  series 


E 


(-1)" 

y/n+{- l)n’ 


which  we  suppose  to  begin  with  the  term  for  which  n  =  2,  to  avoid  any  difficulty 
as  to  the  definitions  of  the  first  few  terms.  This  series  may  be  written  in  the 
form 

v  17  (~1)n  (~1)n]  (~1)n~ 

^  \Vn+(—l)n  yjn  J  yjn 
or 

~  n  +  (-l)V«}  = 

say.  The  series  is  convergent;  but  Xn  is  divergent,  as  all  its  terms  are 

positive,  and  limnxn  =  1-  Hence  the  original  series  is  divergent,  although  it  is 
of  the  form  <f>2  —  4>3  +  </>4  —  . . . ,  where  4>n  — >  0.  This  example  shows  that  the 
condition  that  cf)n  should  tend  steadily  to  zero  is  essential  to  the  truth  of  the 
theorem.  The  reader  will  easily  verify  that  y/2n  +  1  —  1  <  y/2n  +  1,  so  that  this 
condition  is  not  satisfied. 

5.  If  the  conditions  of  §  188  are  satisfied  except  that  4>n  tends  steadily  to  a 
positive  limit  l,  then  the  series  1  )n4>n  oscillates  finitely. 

6.  Alteration  of  the  sum  of  a  conditionally  convergent  series  by 
rearrangement  of  the  terms.  Let  s  be  the  sum  of  the  series  1  — 1  +  ^  — 1  +  ..., 
and  S2n  the  sum  of  its  first  2 n  terms,  so  that  lims2n  =  s. 
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Now  consider  the  series 


'  3  2  '  5  '  7  4' 


(i) 


in  which  two  positive  terms  are  followed  by  one  negative  term,  and  let  t^n  denote 
the  sum  of  the  first  3 n  terms.  Then 


t3n  —  1  +  ^  + 
=  S2n  + 


•  + 


1 


1 


Now 


linr 


1 


2n+  1 
1 


4n  —  1 

1 

+ 


1 

4 


1 

2  n 


+ 


2n  +  3 
1 


+  •••  + 


1 


4n  —  1 


- h 


1 


4n  —  1 


1 

4n 


=  0, 


2n  +  1  2n  +  2  2n  +  3 

since  the  sum  of  the  terms  inside  the  bracket  is  clearly  less  than 
n/(2n  +  l)(2n  +  2);  and 

r2  dx 

I  =  4  Inn  —  >  -  =  4  I 

An 


lim 


1 


+ 


1 


2n  +  2  '  2n  +  4 
by  §§  156  and  158.  Hence 


+ 


An  J  2  n 


^  1  +  (r/n)  J  i 


x 


v  i  /  2  dx 

linr  f3ri  =  s  +  2  /  — , 

'  Ji  x 


and  it  follows  that  the  sum  of  the  series  (1)  is  not  s,  but  the  right-hand  side  of 
the  last  equation.  Later  on  we  shall  give  the  actual  values  of  the  sums  of  the 
two  series:  see  §  213  and  Ch.  IX,  Misc.  Ex.  19. 

It  can  indeed  be  proved  that  a  conditionally  convergent  series  can  always  be 
so  rearranged  as  to  converge  to  any  sum  whatever,  or  to  diverge  to  oo  or  to  —  oo. 
For  a  proof  we  may  refer  to  Bromwich’s  Infinite  Series ,  p.  68. 

7.  The  series 

11111 


1  -  1  1  1 

1  H — ^ - ^  H — ^  H — ^ - + 

\/3  v/2  x/5  \/7  \/4 


diverges  to  oo.  [Here 


tin  —  S2 n  V 


1 


+ 


\j2n  +  1  \j2n  +  3 


+  •••  + 


\/An  —  1 


>  S2n  + 


n 


yj  An  —  1  ’ 


,  1  1  1 

where  s2n  =  1 - ^  A - - 


,  which  tends  to  a  limit  as  n  — >  oo.[ 
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189.  Abel’s  and  Dirichlet’s  Tests  of  Convergence.  A  more  gen¬ 
eral  test,  which  includes  the  test  of  §  188  as  a  particular  test  case,  is  the  following. 

Dirichlet’s  Test.  If  <fn  satisfies  the  same  conditions  as  in  §  188,  and 
Y  an  is  any  series  which  converges  or  oscillates  finitely,  then  the  series 

aofo  +  ai<£i  +  02^2  +  ■  ■  • 

is  convergent. 

The  reader  will  easily  verify  the  identity 
O0</)0  +  ®1</,1  +  '  ■  • ~\~an(f)n  =  •So(</)0  —  +  —  ^>2)  +  '  ■  •  +  Sn_l((^n_l  —  (t>n)~\~  Snfin-I 

where  sn  =  ao  +  a\  +  ■  ■  ■  +  an.  Now  the  series  (</>o  —  fii)  +  {4>  1  —  fifi)  +  . . .  is 
convergent,  since  the  sum  to  n  terms  is  fio  —  and  lirri  <pn  =  0;  and  all  its  terms 
are  positive.  Also  since  Yam  if  not  actually  convergent,  at  any  rate  oscillates 
finitely,  we  can  determine  a  constant  K  so  that  |sj,|  <  K  for  all  values  of  v. 
Hence  the  series 

—  4>v+ 1 ) 

is  absolutely  convergent,  and  so 

so(4>0  ~  <^l)  +  Sl(<f>l  ~  ^2)  +  •  •  •  +  Sn-i((j)n-i  —  (j)n) 

tends  to  a  limit  as  n  — >  00.  Also  <pn ,  and  therefore  snfin,  tends  to  zero.  And 
therefore 

aofio  +  ai<f>i  +  •  •  •  +  an(j)n 

tends  to  a  limit,  i.e.  the  series  is  convergent. 

Abel’s  Test.  There  is  another  test,  due  to  Abel,  which,  though  of  less 
frequent  application  than  Dirichlet’s,  is  sometimes  useful. 

Suppose  that  <j>n,  as  in  Dirichlet’s  Test,  is  a  positive  and  decreasing  function 
of  n,  but  that  its  limit  as  n  — >  00  is  not  necessarily  zero.  Thus  we  postulate 
less  about  4>n,  but  to  make  up  for  this  we  postulate  more  about  Y  a« >  yiz.  that 
it  is  convergent.  Then  we  have  the  theorem:  if  f>n  is  a  positive  and  decreasing 
function  of  n,  and  Yan  is  convergent,  then  Yanfin  is  convergent. 

For  (fn  has  a  limit  as  n  — >  00,  say  l :  and  lim(<5n  —  l)  =  0.  Hence,  by 
Dirichlet’s  Test,  Y  an{fin  ~  0  is  convergent;  and  as  Y  an  is  convergent  it  follows 
that  Y  anf>n  is  convergent. 

This  theorem  may  be  stated  as  follows:  a  convergent  series  remains  conver¬ 
gent  if  we  multiply  its  terms  by  any  sequence  of  positive  and  decreasing  factors. 
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Examples  LXXIX.  1.  Dirichlet’s  and  Abel’s  Tests  may  also  be  estab¬ 
lished  by  means  of  the  general  principle  of  convergence  (§  84).  Let  us  suppose, 
for  example,  that  the  conditions  of  Abel’s  Test  are  satisfied.  We  have  identically 

®m^m  T  Qm+1 0771+1  V  '  '  '  +  ®n0n  =  Sm!m(0m  0m+ 1)  T  Sm,m+ l(0777+l  0777+ 2) 

T  ■  ‘  ‘  T  Sm,n—1  (0n—l  07 7)  T  Srrijntftn  ■  ■  ■  >  (1) 


where 

S  171,1;  =  +  &777+1  +  •  •  •  +  Ujy- 

The  left-hand  side  of  (1)  therefore  lies  between  h(j)m  and  H(j)m,  where  h  and  H 
are  the  algebraically  least  and  greatest  of  sm)Tn,  sm<rn+ 1,  . . . ,  sm,n.  But,  given 
any  positive  number  e,  we  can  choose  mo  so  that  |sTO:„|  <  e  when  m  A  mo,  and 
so 

1^7770771  +  0777+10777+1  +  •  •  •  +  ^77077  1  <  £0777  =  £01 

when  n  >  m  ^  mo-  Thus  the  series  Y  an0n  is  convergent. 

2.  The  series  Ycosn@  and  ^sin  nd  oscillate  hnitely  when  9  is  not  a  mul¬ 
tiple  of  7 r.  For,  if  we  denote  the  sums  of  the  first  n  terms  of  the  two  series  by  sn 
and  tn,  and  write  2  =  Cis0,  so  that  \z\  =  1  and  z  0  1,  we  have 

1  -  zn  ^  1  +  \zn\  ^  2 

1  -  z  =  |1  -  z\  =  1 1  —  z\  ’ 

and  so  |sn|  and  |in|  are  also  not  greater  than  2/ 1 1  —  z  |.  That  the  series  are  not 
actually  convergent  follows  from  the  fact  that  their  nth  terms  do  not  tend  to 
zero  (Exs.  XXIV.  7,  8). 

The  sine  series  converges  to  zero  if  9  is  a  multiple  of  7r.  The  cosine  series 
oscillates  hnitely  if  9  is  an  odd  multiple  of  7r  and  diverges  if  9  is  an  even  multiple 
of  7T. 

It  follows  that  if  9n  is  a  positive  function  of  n  which  tends  steadily  to  zero 
as  n  — »  oo,  then  the  series 


I £*77  T  itn\  — 


Y&n  cos  nQ,  Yfin  sin  nO 

are  convergent,  except  perhaps  the  first  series  when  9  is  a  multiple  of  27r.  In 
this  case  the  first  series  reduces  to  Y^m  which  may  or  may  not  be  convergent: 
the  second  series  vanishes  identically.  If  Y  077  is  convergent  then  both  series 
are  absolutely  convergent  (Ex.  lxxvii.  4)  for  all  values  of  9,  and  the  whole 
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interest  of  the  result  lies  in  its  application  to  the  case  in  which  cj>n  is  divergent. 
And  in  this  case  the  series  above  written  are  conditionally  and  not  absolutely 
convergent,  as  will  be  proved  in  Ex.  LXXIX.  6.  If  we  put  8  =  it  in  the  cosine 
series  we  are  led  back  to  the  result  of  §  188,  since  cosn7r  =  (—1)"  . 

3.  The  series  Y2  n~s  cos  n8,  ]Cn-ssinn(9  are  convergent  if  s  >  0,  unless  (in 
the  case  of  the  first  series)  8  is  a  multiple  of  2-7T  and  0  <  s  Y  1. 

4.  The  series  of  Ex.  3  are  in  general  absolutely  convergent  if  s  >  1,  condi¬ 
tionally  convergent  if  0  <  s  Y  1,  and  oscillatory  if  s  Y  0  (finitely  if  s  =  0  and 
infinitely  if  s  <  0).  Mention  any  exceptional  cases. 

5.  If  Y2  CLnn~s  is  convergent  or  oscillates  finitely,  then  Y2  annt  is  convergent 
when  t  >  s. 

6.  If  4>n  is  a  positive  function  of  n  which  tends  steadily  to  0  as  n  — >■  oo,  and 
J2  fin  is  divergent,  then  the  series  Y2  cos  n8,  ^ ~^(j)ns\nnd  are  not  absolutely 
convergent,  except  the  sine-series  when  8  is  a  multiple  of  it.  [For  suppose, 
e.g.,  that  rf’n]  cosnd\  is  convergent.  Since  cos 2  n8  Y  |cosn#|,  it  follows  that 
Y2  4>n  cos2  nd  or 

\Y2,(t)n{  1  +  cos  2  nd) 

is  convergent.  But  this  is  impossible,  since  Yh&n  is  divergent  and  Y2,  cos  2nd , 
by  Dirichlet’s  Test,  convergent,  unless  8  is  a  multiple  of  n.  And  in  this  case 
it  is  obvious  that  ^)</>n|cosn0|  is  divergent.  The  reader  should  write  out  the 
corresponding  argument  for  the  sine-series,  noting  where  it  fails  when  8  is  a 
multiple  of  7r.] 

190.  Series  of  complex  terms.  So  far  we  have  confined  ourselves 
to  series  all  of  whose  terms  are  real.  We  shall  now  consider  the  series 

XX  =  XX  +  iwn), 

where  vn  and  wn  are  real.  The  consideration  of  such  series  does  not,  of 
course,  introduce  anything  really  novel.  The  series  is  convergent  if,  and 
only  if,  the  series 

XW,  XWn 

are  separately  convergent.  There  is  however  one  class  of  such  series  so  im¬ 
portant  as  to  require  special  treatment.  Accordingly  we  give  the  following 
definition,  which  is  an  obvious  extension  of  that  of  §  184. 
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Definition.  The  series  Yun,  where  un  =  vn+iwn,  is  said  to  be  abso¬ 
lutely  convergent  if  the  series  Y  vn  and  Y  wn  are  absolutely  convergent. 

Theorem.  The  necessary  and  sufficient  condition  for  the  absolute  con¬ 
vergence  ofYun  ts  the  convergence  ofY  \un\  °r  Y  \J vn  +  wn ■ 

For  if  Yun  is  absolutely  convergent,  then  both  of  the  series  Y  \vn\, 
Y  \wn\  are  convergent,  and  so  ^{|nn|  +  wn | }  is  convergent:  but 

\Un\  =  VVn  +  Wl  =  K|  +  \wn\, 

and  therefore  Y  \un\  is  convergent.  On  the  other  hand 
M  ^  \/v2n  +  w2n,  \wn\  ^  \/v2n  +  w2n, 

so  that  Y  \vn\  and  Y  \wn\  are  convergent  whenever  Y  \un\  is  convergent. 

It  is  obvious  that  an  absolutely  convergent  series  is  convergent,  since 
its  real  and  imaginary  parts  converge  separately.  And  Dirichlet’s  Theorem 
(§§  169,  185)  may  be  extended  at  once  to  absolutely  convergent  complex 
series  by  applying  it  to  the  separate  series  Y  vn  and  Y  wn- 

The  convergence  of  an  absolutely  convergent  series  may  also  be  deduced 
directly  from  the  general  principle  of  convergence  (cf.  Ex.  LXXVli.  1).  We  leave 
this  as  an  exercise  to  the  reader. 

191.  Power  Series.  One  of  the  most  important  parts  of  the  theory  of 
the  ordinary  functions  which  occur  in  elementary  analysis  (such  as  the  sine 
and  cosine,  and  the  logarithm  and  exponential,  which  will  be  discussed  in 
the  next  chapter)  is  that  which  is  concerned  with  their  expansion  in  series 
of  the  form  Y  anxn.  Such  a  series  is  called  a  power  series  in  x.  We 
have  already  come  across  some  cases  of  expansion  in  series  of  this  kind  in 
connection  with  Taylor’s  and  Maclaurin’s  series  (§  148).  There,  however, 
we  were  concerned  only  with  a  real  variable  x.  We  shall  now  consider  a 
few  general  properties  of  power  series  in  z,  where  z  is  a  complex  variable. 

A.  A  power  series  Y  anZn  may  be  convergent  for  all  values  of  z,  for  a 
certain  region  of  values,  or  for  no  values  except  z  =  0. 

It  is  sufficient  to  give  an  example  of  each  possibility. 
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1.  The  series  ^  — -  is  convergent  for  all  values  of  z.  For  if  un  =  —  then 


\un+i\/\un\  =  \z\/(n  +  1)  ->•  0 


as  n  — >  oo,  whatever  value  z  may  have.  Hence,  by  d’Alembert’s  Test,  ^2  \un\  is 
convergent  for  all  values  of  z,  and  the  original  series  is  absolutely  convergent  for 
all  values  of  z.  We  shall  see  later  on  that  a  power  series,  when  convergent,  is 
generally  absolutely  convergent. 

2.  The  series  ^  n!  is  not  convergent  for  any  value  of  z  except  z  =  0. 
For  if  un  =  n\  zn  then  |un+i|/|wn|  =  (n  +  l)|z|,  which  tends  to  oo  with  n,  unless 
z  =  0.  Hence  (cf.  Exs.  xxvil.  1,  2,  5)  the  modulus  of  the  nth  term  tends  to  oo 
with  n;  and  so  the  series  cannot  converge,  except  when  z  =  0.  It  is  obvious  that 
any  power  series  converges  when  z  =  0. 

3.  The  series  Y2  zn  always  convergent  when  \z\  <  1,  and  never  convergent 
when  \z\  If  1.  This  was  proved  in  §  88.  Thus  we  have  an  actual  example  of  each 
of  the  three  possibilities. 

192.  B.  If  a  power  series  ^  anzn  is  convergent  for  a  particular  value 
of  z,  say  z\  =  ri(cos#i  +  isin^i),  then  it  is  absolutely  convergent  for  all 
values  of  z  such  that  \z\  <  r\. 

For  liin anzf  =  0,  since  anZi  is  convergent,  and  therefore  we  can 
certainly  find  a  constant  K  such  that  |anz”  |  <  K  for  all  values  of  n.  But, 
if  |o|  =  r  <  7T,  we  have 


and  the  result  follows  at  once  by  comparison  with  the  convergent  geomet¬ 
rical  series  ^(r/ri)n. 

In  other  words,  if  the  series  converges  at  P  then  it  converges  absolutely 
at  all  points  nearer  to  the  origin  than  P. 

Example.  Show  that  the  result  is  true  even  if  the  series  oscillates  finitely 
when  z  =  z\.  [If  sn  =  ao  +  oi^i  +  •  •  •  +anzf  then  we  can  find  K  so  that  |sn|  <  K 
for  all  values  of  n.  But  |an2;”|  =  |sn  —  sn_i|  ^  |sn_i|  +  |sn|  <  2K,  and  the 
argument  can  be  completed  as  before.] 
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193.  The  region  of  convergence  of  a  power  series.  The  circle 
of  convergence.  Let  z  =  r  be  any  point  on  the  positive  real  axis.  If 
the  power  series  converges  when  z  =  r  then  it  converges  absolutely  at  all 
points  inside  the  circle  \z\  —  r.  In  particular  it  converges  for  all  real  values 
of  z  less  than  r. 

Now  let  us  divide  the  points  r  of  the  positive  real  axis  into  two  classes, 
the  class  at  which  the  series  converges  and  the  class  at  which  it  does  not. 
The  first  class  must  contain  at  least  the  one  point  z  —  0.  The  second  class, 
on  the  other  hand,  need  not  exist,  as  the  series  may  converge  for  all  values 
of  0.  Suppose  however  that  it  does  exist,  and  that  the  first  class  of  points 
does  include  points  besides  z  =  0.  Then  it  is  clear  that  every  point  of  the 
first  class  lies  to  the  left  of  every  point  of  the  second  class.  Hence  there  is 
a  point,  say  the  point  z  =  R,  which  divides  the  two  classes,  and  may  itself 
belong  to  either  one  or  the  other.  Then  the  series  is  absolutely  convergent 
at  all  points  inside  the  circle  \z\  =  R. 

For  let  P  be  any  such  point.  We  can  draw  a  circle,  whose  centre  is  O 
and  whose  radius  is  less  than  R ,  so  as  to  include  P  inside  it.  Let  this 


O 


X 


Fig.  51. 


circle  cut  OA  in  Q.  Then  the  series  is  convergent  at  Q,  and  therefore,  by 
Theorem  B,  absolutely  convergent  at  P. 

On  the  other  hand  the  series  cannot  converge  at  any  point  P'  outside 
the  circle.  For  if  it  converged  at  P'  it  would  converge  absolutely  at  all 
points  nearer  to  O  than  P;  and  this  is  absurd,  as  it  does  not  converge  at 
any  point  between  A  and  Q'  (Fig.  51). 
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So  far  we  have  excepted  the  cases  in  which  the  power  series  (1)  does  not 
converge  at  any  point  on  the  positive  real  axis  except  z  =  0  or  (2)  converges 
at  all  points  on  the  positive  real  axis.  It  is  clear  that  in  case  (1)  the  power 
series  converges  nowhere  except  when  z  =  0,  and  that  in  case  (2)  it  is 
absolutely  convergent  everywhere.  Thus  we  obtain  the  following  result:  a 
power  series  either 

(1)  converges  for  z  =  0  and  for  no  other  value  of  z;  or 

(2)  converges  absolutely  for  all  values  of  z;  or 

(3)  converges  absolutely  for  all  values  of  z  within  a  certain  circle  of 
radius  R,  and  does  not  converge  for  any  value  of  z  outside  this 
circle. 

In  case  (3)  the  circle  is  called  the  circle  of  convergence  and  its  radius 
the  radius  of  convergence  of  the  power  series. 

It  should  be  observed  that  this  general  result  gives  absolutely  no  infor¬ 
mation  about  the  behaviour  of  the  series  on  the  circle  of  convergence.  The 
examples  which  follow  show  that  as  a  matter  of  fact  there  are  very  diverse 
possibilities  as  to  this. 


Examples  LXXX.  1.  The  series  1  +  az  +  a2z 2  +  . . . ,  where  a  >  0, 
has  a  radius  of  convergence  equal  to  1/a.  It  does  not  converge  anywhere  on  its 
circle  of  convergence,  diverging  when  z  =  1/a  and  oscillating  finitely  at  all  other 
points  on  the  circle. 


2. 


z  z 2  z3 


The  series  -( — ~  H — ~  +  . . .  has  its  radius  of  convergence  equal  to  1 ; 

lz  2Z  3Z 


it  converges  absolutely  at  all  points  on  its  circle  of  convergence. 
3.  More  generally,  if  |an+i|/|an|  — >  A,  or  \an\l^n  — >  A,  as  n 


oo,  then  the 


series  aQ  +  a\z  +  a2Z2  +  . . .  has  1/A  as  its  radius  of  convergence.  In  the  first  case 


lim  \an+izn+l\/\anzn\  =  A|z|, 


which  is  less  or  greater  than  unity  according  as  \z\  is  less  or  greater  than  1/A, 
so  that  we  can  use  d’Alembert’s  Test  (§  168,  3).  In  the  second  case  we  can  use 
Cauchy’s  Test  (§  168,  2)  similarly. 

4.  The  logarithmic  series.  The  series 
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is  called  (for  reasons  which  will  appear  later)  the  ‘logarithmic’  series.  It  follows 
from  Ex.  3  that  its  radius  of  convergence  is  unity. 

When  2  is  on  the  circle  of  convergence  we  may  write  2  =  cos  9  +  i  sin  9,  and 
the  series  assumes  the  form 

cos  9  —  \  cos  29  +  I  cos  3 9  —  ■  ■  ■  +  i(sin0  —  \  sin  26  +  ^  sin  39  —  ...). 


The  real  and  imaginary  parts  are  both  convergent,  though  not  absolutely 
convergent,  unless  9  is  an  odd  multiple  of  n  (Exs.  lxxix.  3,  4).  If  9  is  an  odd 
multiple  of  ir  then  z  =  —  1,  and  the  series  assumes  the  form  —  1  —  ^  |  —  . . . , 

and  so  diverges  to  — oo.  Thus  the  logarithmic  series  converges  at  all  points  of 
its  circle  of  convergence  except  the  point  z  =  —  1. 

5.  The  binomial  series.  Consider  the  series 


m(m  —  1)  o  m(m  —  l)(m  —  2)  q 
1  +  mz  +  — ^ - -  z^  +  — ^ ^ - ’-zz  +  . . . . 

If  m  is  a  positive  integer  then  the  series  terminates.  In  general 


&71+1 

|&n| 


m  —  n 
n  +  1 


so  that  the  radius  of  convergence  is  unity.  We  shall  not  discuss  here  the  question 
of  its  convergence  on  the  circle,  which  is  a  little  more  difficult.* 


194.  Uniqueness  of  a  power  series.  If  ^an2n  is  a  power  series 
which  is  convergent  for  some  values  of  2  at  any  rate  besides  2  =  0,  and  f(z)  is 
its  sum,  then  it  is  easy  to  see  that  f(z)  can  be  expressed  in  the  form 

0-0  +  0\z  +  0,2  -22  +  •  •  •  +  ( On  +  €z)zn , 


where  ez  — >  0  as  |2|  — >  0.  For  if  /i  is  any  number  less  than  the  radius  of 
convergence  of  the  series,  and  1 2|  <  /r,  then  \an\/in  <  K ,  where  K  is  a  constant 
(cf.  §  192),  and  so 


n 


m  -  E 


-  7  jauz 
o 


S\on+1\\zn+l\  +  \an+2\\zn+2\  + 

n+l 


<  K 


\z\  z 

1  H - 1 - n — h  •  •  • 

H  nz 


K\z\n+1 
-  1 2 1 ) ? 


*See  Bromwich,  Infinite  Series ,  pp.  225  et  seq.;  Hobson,  Plane  Trigonometry 
(3rd  edition),  pp.  268  et  seq. 
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where  K  is  a  number  independent  of  z.  It  follows  from  Ex.  LV.  15  that  if 
Y  CLnzn  =  Y  bnzn  for  all  values  of  2  whose  modulus  is  less  than  some  number  /1, 
then  an  =  bn  for  all  values  of  n.  This  result  is  capable  of  considerable  general¬ 
isations  into  which  we  cannot  enter  now.  It  shows  that  the  same  function  f{z) 
cannot  be  represented  by  two  different  power  series. 


195.  Multiplication  of  Series.  We  saw  in  §  170  that  if  Yun  and 

Y  vn  are  two  convergent  series  of  positive  terms,  then  Yun  x  Yvn  — 

Y  wn,  where 

Wn  =  UqVu  +  U\Vn—i  +  •  •  •  +  unv  o- 

We  can  now  extend  this  result  to  all  cases  in  which  Y  un  and  Y  vn  are 
absolutely  convergent;  for  our  proof  was  merely  a  simple  application  of 
Dirichlet’s  Theorem,  which  we  have  already  extended  to  all  absolutely 
convergent  series. 


Examples  LXXXI.  1.  If  \z\  is  less  than  the  radius  of  convergence  of 
either  of  the  series  Y  anZn,  Y  bnzn,  then  the  product  of  the  two  series  is  Y  cnZn, 
where  cn  =  aobn  +  aibn-i  H - b  a„V 

2.  If  the  radius  of  convergence  of  YanZn  is  R,  and  f(z)  is  the  sum 

of  the  series  when  \z\  <  R,  and  \z\  is  less  than  either  R  or  unity,  then 
f(z)/(  1  -  z)  =  Y  snZn ,  where  sn  =  a0  +  cp  -| - h  an. 

3.  Prove,  by  squaring  the  series  for  1/(1  —  z ),  that  1/(1  —  z)2  =  1  +  2 z  + 
3z2  +  . . .  if  \z\  <  1. 

4.  Prove  similarly  that  1/(1  —  z )3  =  1  +  3 z  +  6 z2  +  . . . ,  the  general  term 
being  \{n  +  l)(n  +  2 )zn. 

5.  The  Binomial  Theorem  for  a  negative  integral  exponent.  If 

1 2: |  <  1,  and  m  is  a  positive  integer,  then 


1 

(1  -  z)m 


1  +  mz  + 


m(m  +  1)  9  m(m  +  1) . . .  (m  +  n  —  1)  „ 

— - -zz  H - 1 - - - - - - - -zn  + 

1-2  1-2  ...n 


[Assume  the  truth  of  the  theorem  for  all  indices  up  to  m.  Then,  by  Ex.  2, 
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1/(1  —  z)m+1  =  Y^snZn,  where 

m(m  +  1)  m(m  +  1) . . .  (m  +  n  —  1) 

Syi  —  1  V  Tfl  T  -  +  •  •  •  +  - 

1-2  1  -2...n 

(m  +  l)(m  +  2) . . .  (m  +  n) 

~  1-2  ...n  ’ 

as  is  easily  proved  by  induction.] 

6.  Prove  by  multiplication  of  series  that  if 

/<»,*) =i+ (7) 

and  |^;|  <  1,  then  f(m,  z)f(m',  z)  =  f(m  +  m ' ,  z).  [This  equation  forms  the  basis 
of  Euler’s  proof  of  the  Binomial  Theorem.  The  coefficient  of  zn  in  the  product 
series  is 


This  is  a  polynomial  in  m  and  w! :  but  when  rn  and  m!  are  positive  integers 

(YYl  TYlf\ 

in  virtue  of  the  Binomial  Theorem 

k  J 

for  a  positive  integral  exponent,  and  if  two  such  polynomials  are  equal  for  all 
positive  integral  values  of  m  and  m!  then  they  must  be  equal  identically.] 

7.  If  f(z)  =  1  +  z  +  —  +  . . .  then  f(z)f(z')  =  f(z  +  z').  [For  the  series 

for  f(z)  is  absolutely  convergent  for  all  values  of  z:  and  it  is  easy  to  see  that  if 

zn  z,n  (z  +  z')n 

un  =  — r,  vn  =  — ,  then  wn  =  - - - . 

n\  n\  n\ 


z2  z4 

C«  =  1-¥+4T- 


3  5 


C(z  +  z')  =  C(z)C(z')  -  S(z)S(z'),  S(z  +  z')  =  S(z)C(z')  +  C(z)S(z'), 


{C(z»2  +  {S(z)}2  =  1. 
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9.  Failure  of  the  Multiplication  Theorem.  That  the  theorem  is  not 
always  true  when  X  un  and  X  vn  are  not  absolutely  convergent  may  be  seen  by 
considering  the  case  in  which 

_  Mr 

—  Vn  —  / - —  • 

Vn  +  1 

Then 

n  1 

^  ^  5  yj(r  +  l)(n  +  l-r) 

But  yf(r  +  l)(n  +  1  —  r)  V  l(n  +  2),  and  so  |tCn|  >  (2n  +  2)/(n  +  2),  which  tends 
to  2;  so  that  X  wn  is  certainly  not  convergent. 

MISCELLANEOUS  EXAMPLES  ON  CHAPTER  VIII. 

1.  Discuss  the  convergence  of  the  series  X  nk{y/n  +  1  —  2 Vn  +  Vn  —  1}, 

where  k  is  real.  (Math.  Trip.  1890.) 

2.  Show  that 

I>rAV), 

where 

Aw?},  =  Un  Un-\- 1?  A  Un  =  A(AUnV 

and  so  on,  is  convergent  if  and  only  iffc  >r  +  s  +  l,  except  when  s  is  a  positive 
integer  less  than  k,  when  every  term  of  the  series  is  zero. 

[The  result  of  Ch.  VII,  Misc.  Ex.  11,  shows  that  A fc(ns)  is  in  general  of 
order  ns~k.} 

3.  Show  that 

CC  n2  +  9n  +  5  5 

^  (n  +  l)(2n  +  3)(2n  +  5)(n  +  4)  “  36' 

(Math.  Trip.  1912.) 

[Resolve  the  general  term  into  partial  fractions.] 

4.  Show  that,  if  R(n )  is  any  rational  function  of  n,  we  can  determine 
a  polynomial  P(n)  and  a  constant  A  such  that  XI^(n)  —  P(n)  —  (A/n)}  is 
convergent.  Consider  in  particular  the  cases  in  which  R(n )  is  one  of  the  functions 
l/(an  +  b),  (an2  +  2 bn  +  c)/(an 2  +  2 f3n  +  7). 
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5.  Show  that  the  series 

11111 

1 - 1 - 1 - 1-  •  •  • 

1  +  z  2  2  +  z  3  3  +  z 

is  convergent  provided  only  that  z  is  not  a  negative  integer. 

6.  Investigate  the  convergence  or  divergence  of  the  series 

Vsin-,  ylsin-,  V^sin-, 

^(1_C0S3’ 

where  a  is  real. 

7.  Discuss  the  convergence  of  the  series 


E 


i 


1  1 

1+2+3+ 


1  \  sin(n#  +  a) 
n )  n 


where  9  and  a  are  real. 

8.  Prove  that  the  series 


(Math.  Trip.  1989.) 


1 


l-l+l+l+l-l 

2  3^4^5^6  7 


10 


+  •  •  • 


in  which  successive  terms  of  the  same  sign  form  groups  of  1,  2,  3,  4,  . . .  terms, 
is  convergent;  but  that  the  corresponding  series  in  which  the  groups  contain  1, 
2,  4,  8,  ...  terms  oscillates  finitely.  (Math.  Trip.  1908.) 

9.  If  ui,  U2,  us,  ...  is  a  decreasing  sequence  of  positive  numbers  whose 
limit  is  zero,  then  the  series 


u\  -  \(ui+u2)  +  l(ui  +  U2  +  U3)  -  . . . ,  l(ui+u3)  +  \(u\  +U3  +  U5)  -  . . . 

are  convergent.  [For  if  (u±  +  u2  +  •  •  •  +  un)/n  =  vn  then  v\,  v2,  v3,  . . .  is  also  a 
decreasing  sequence  whose  limit  is  zero  (Ch.  IV,  Misc.  Exs.  8,  27).  This  shows 
that  the  first  series  is  convergent;  the  second  we  leave  to  the  reader.  In  particular 
the  series 


1  _  5  i1  +  5)  +  3  i1  +  \  +  g)  ~  •  •  •  ’ 


i-Hi  +  £)  +  Hi  +  !  +  S)-- 


are  convergent.] 
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10.  If  uq  +  u\  +  U2  +  •  •  •  is  a  divergent  series  of  positive  and  decreasing  terms, 
then 

(uo  +  U2  H - b  U2n)/ (ui  +  U3  H - b  U2n+l)  1- 


11.  Prove  that  if  a  >  0  then  lim  V  (p  +  n)  1  “  =  0. 

P^OO  n—Q 
OO 

12.  Prove  that  lim  Qy'V1^“  =  l.  [It  follows  from  §  174  that 

a-s>0+  i 


0  <  l"1_a  +  2~1~a  +  ■■■  +  («-  l)"1"0 


w —  1 — Q: 


dx  <  1, 


and  it  is  easy  to  deduce  that  1  “  lies  between  1/a  and  (1/a)  +  1.] 

OO 

13.  Find  the  sum  of  the  series  s^Juni  where 

i 


x  —  x 


—n—  1 


1 


Un  = 


(. xn  +  x~n){xn+l  +  x~n~l)  x  —  1  \Xn  +  X 


Vn+1  +  : 


.—n—l 


for  all  real  values  of  x  for  which  the  series  is  convergent.  (Math.  Trip.  1901.) 

[If  |x|  is  not  equal  to  unity  then  the  series  has  the  sum  x/{(x  —  l)(.x2  +  1)}. 
If  x  =  1  then  un  =  0  and  the  sum  is  0.  If  x  =  —  1  then  un  =  ^(— l)n+1  and  the 
series  oscillates  finitely.] 

14.  Find  the  sums  of  the  series 


+ 


2V 


1  +  z  1  +  z“ 


+ 


4z4 


1  + V 


+  . . . , 


+ 


1-z2  1-z 


+ 


1  -  zl 


+  ... 


(in  which  all  the  indices  are  powers  of  2),  whenever  they  are  convergent. 

[The  first  series  converges  only  if  \z\  <  1,  its  sum  then  being  z/(  1  —  z);  the 
second  series  converges  to  z/(  1  —  z)  if  \z\  <  1  and  to  1/(1  —  z)  if  |^|  >  1.] 

15.  If  |on|  =  1  f°r  all  values  of  n  then  the  equation 


0  —  1  ~b  cl\z  ~b  a2z  +  . . . 


cannot  have  a  root  whose  modulus  is  less  than  ^ ,  and  the  only  case  in  which  it 
can  have  a  root  whose  modulus  is  equal  to  \  is  that  in  which  an  =  —  Cis(n0), 
when  z  =  \  Cis(— 6)  is  a  root. 
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16.  Recurring  Series.  A  power  series  ^  anzn  is  said  to  be  a  recurring 
series  if  its  coefficients  satisfy  a  relation  of  the  type 

V  Pl^n—l  T  P2®n— 2  T  '  '  '  T  Pk^n—k  —  0;  (1) 

where  n  ^  k  and  p\ ,  p2,  ■  ■  ■ ,  Pk  are  independent  of  n.  Any  recurring  series  is 
the  expansion  of  a  rational  function  of  2.  To  prove  this  we  observe  in  the  first 
place  that  the  series  is  certainly  convergent  for  values  of  2  whose  modulus  is 
sufficiently  small.  For  let  G  be  the  greater  of  the  two  numbers 

1,  M  +  N  h - f  \pk\- 

Then  it  follows  from  the  equation  (1)  that  |an|  =  Gan,  where  an  is  the  modulus 
of  the  numerically  greatest  of  the  preceding  coefficients;  and  from  this  that 
|an|  <  KGn,  where  K  is  independent  of  n.  Thus  the  recurring  series  is  certainly 
convergent  for  values  of  2  whose  modulus  is  less  than  1/G. 

But  if  we  multiply  the  series  f(z )  =  anzn  by  p\z,  P2Z 2,  . . . ,  pkZk ,  and  add 
the  results,  we  obtain  a  new  series  in  which  all  the  coefficients  after  the  (k  —  l)th 
vanish  in  virtue  of  the  relation  (1),  so  that 

(1  +  piz  +  p2z 2  H - b  pkzk)f(z )  =  P0  +  Piz  4 - f  Pfc_i2fc_1, 

where  Po,  P\,  . . . ,  Pk-i  are  constants.  The  polynomial  l+piz+p2z2  +  •  •  • +PkZk 
is  called  the  scale  of  relation  of  the  series. 

Conversely,  it  follows  from  the  known  results  as  to  the  expression  of  any 
rational  function  as  the  sum  of  a  polynomial  and  certain  partial  fractions  of 
the  type  A/(z  —  a)p,  and  from  the  Binomial  Theorem  for  a  negative  integral 
exponent,  that  any  rational  function  whose  denominator  is  not  divisible  by  2 
can  be  expanded  in  a  power  series  convergent  for  values  of  2  whose  modulus  is 
sufficiently  small,  in  fact  if  |2 1  <  p,  where  p  is  the  least  of  the  moduli  of  the 
roots  of  the  denominator  (cf.  Ch.  IV,  Misc.  Exs.  18  et  seq.).  And  it  is  easy  to 
see,  by  reversing  the  argument  above,  that  the  series  is  a  recurring  series.  Thus 
the  necessary  and  sufficient  condition  that  a  power  series  should  be  a  recurring 
series  is  that  it  shoidd  be  the  expansion  of  such  a  rational  function  of  z. 

17.  Solution  of  Difference-Equations.  A  relation  of  the  type  of  (1)  in 
Ex.  16  is  called  a  linear  difference-equation  in  an  with  constant  coefficients.  Such 
equations  may  be  solved  by  a  method  which  will  be  sufficiently  explained  by  an 
example.  Suppose  that  the  equation  is 

ttn— 1  8U77— 2  V  12a„._3  =  0. 
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Consider  the  recurring  power  series  'Y^/anzn.  We  find,  as  in  Ex.  16,  that  its  sum 
is 

a0  +  (ai  -  a0)z  +  (a2  —  a\  -  8a0)z2  _  Ai  A2  B 

1  -  z  -  8z2  +  12z3  “  1  -  2z  +  (1  -  2 z)2  +  TT~3z’ 

where  A\ ,  A2,  and  B  are  numbers  easily  expressible  in  terms  of  ao,  ai,  and  a2. 
Expanding  each  fraction  separately  we  see  that  the  coefficient  of  zn  is 

an  =  2rL{A\  +  (n  +  1)A2}  +  (—3  )nB. 

The  values  of  Ai,  A2,  B  depend  upon  the  first  three  coefficients  ao,  ai,  a2,  which 
may  of  course  be  chosen  arbitrarily. 

18.  The  solution  of  the  difference-equation  un  —  2cos6htn_i  +  un-2  =  0  is 
un  =  A  cos  n9  +  B  sin  nO,  where  A  and  B  are  arbitrary  constants. 

19.  If  un  is  a  polynomial  in  n  of  degree  k,  then  ^  unzn  is  a  recurring  series 

whose  scale  of  relation  is  (1  —  z)k+1 .  (Math.  Trip.  1904.) 

20.  Expand  9 /{(z  —  1  )(z  +  2)2}  in  ascending  powers  of  z. 

(Math.  Trip.  1913.) 

21.  Prove  that  if  f(n)  is  the  coefficient  of  zn  in  the  expansion  of  z/ (1+z+z2) 
in  powers  of  z,  then 

(1)  f(n)  +  f(n  -  1)  +  f(n  -  2)  =  0,  (2)  f(n)  =  (c*$  -  wln)/(w3  -  w|), 

where  w3  is  a  complex  cube  root  of  unity.  Deduce  that  f(n )  is  equal  to  0  or  1 
or  —1  according  as  n  is  of  the  form  3k  or  3k  +  1  or  3k  +  2,  and  verify  this  by 

means  of  the  identity  z/(l  +  z  +  z2)  =  z(  1  —  z)/(  1  —  z3). 

22.  A  player  tossing  a  coin  is  to  score  one  point  for  every  head  he  turns 
up  and  two  for  every  tail,  and  is  to  play  on  until  his  score  reaches  or  passes  a 
total  n.  Show  that  his  chance  of  making  exactly  the  total  n  is  |{2  +  (— ^)n}. 

(Math.  Trip.  1898.) 

[If  pn  is  the  probability  then  pn  =  \(jpn- 1  +  pn- 2)-  Also  p0  =  1,  pi  =  \] 

23.  Prove  that 

11  1  _  fn\  1  fn\  1! 

a+l~l~a  +  2~l~  ~*~a  +  n  \l/a  +  l  \2y  (a  +  l)(a  +  2)  ~*~ 

if  n  is  a  positive  integer  and  a  is  not  one  of  the  numbers  —1,  —2,  . . . ,  —  n. 
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[This  follows  from  splitting  up  each  term  on  the  right-hand  side  into  partial 
fractions.  When  a  >  —1,  the  result  may  be  deduced  very  simply  from  the 
equation 


A-xn 
1  —  x 


dx 


(1 -*)“{!-(! 


*n 


dx 

x 


by  expanding  (1  —  xn)/(l  —  x)  and  1  —  (1  —  x)n  in  powers  of  x  and  integrating 
each  term  separately.  The  result,  being  merely  an  algebraical  identity,  must  be 
true  for  all  values  of  a  save  —  1,  —2,  . . . ,  —  n.] 

24.  Prove  by  multiplication  of  series  that 


'4—'  n!  "4—/  n  .  n\ 
o  1 


[The  coefficient  of  zn  will  be  found  to  be 


Now  use  Ex.  23,  taking  a  =  0.] 

25.  If  An  — >  A  and  Bn  — >  B  as  n  — >  oo,  then 


( A\Bn  +  A2Bn-\  +  •  •  •  +  AnBi)/n  — >  AB. 

[Let  An  =  A  +  en.  Then  the  expression  given  is  equal  to 

^B\  +  I?2  +  •  •  •  +  Bn  e\Bn  +  £2Bn—i  +  •  •  •  +  enB\ 
n  n 

The  hrst  term  tends  to  AB  (Ch.  IV,  Misc.  Ex.  27).  The  modulus  of  the 
second  is  less  than  /?{|ei|  +  |  €2 1  +  •  •  •  +  |en|}/n,  where  /3  is  any  number  greater 
than  the  greatest  value  of  \BU\:  and  this  expression  tends  to  zero.] 

26.  Prove  that  if  cn  =  a\bn  +  a2bn-\  +  •  •  •  +  anb\  and 

An  =  ai  +  <22  +  •  •  •  +  an,  Bn  =  b\  +  62  +  ■  ■  ■  +  bn,  Cn  =  ci  +  C2  +  •  •  •  +  cn, 
then 


Cn  —  CL\Bn  +  a,2Bn—\  +  •  •  •  +  anB  1  —  b\An  +  b2An— 1  +  •  •  •  +  bnAi 
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and 

C\  +  C‘2  +  •  •  •  +  Cn  =  A\Bn  +  A2Bn_i  +  •  •  •  +  AnB\ . 

Hence  prove  that  if  the  series  ^  an,  ^  are  convergent  and  have  the  sums 
A,  B,  so  that  An  — >•  A,  Bn  — >  B ,  then 


(Cl  +  C'2  +  •  •  •  +  Cn) / 77.  — >  AB. 


Deduce  that  if  ^  cn  is  convergent  then  its  sum  is  AB.  This  result  is  known 
as  Abel’s  Theorem  on  the  multiplication  of  Series.  We  have  already 
seen  that  we  can  multiply  the  series  ]> ~fan ,  in  this  way  if  both  series  are 

absolutely  convergent:  Abel’s  Theorem  shows  that  we  can  do  so  even  if  one 
or  both  are  not  absolutely  convergent,  provided  only  that  the  product  series  is 
convergent. 

27.  Prove  that 


i  (i  - 1  + 1  -  •  -  -  )2  =  I  -  Hi  + 1)  +  Mi  + 1  + 1)  -  •  •  •  • 

[Use  Ex.  9  to  establish  the  convergence  of  the  series.] 

/*7T 

28.  For  what  values  of  rn  and  n  is  the  integral  /  sinm  x(l  —  cos  x)n  dx 

Jo 

convergent?  [If  m  +  1  and  m  +  2n  +  1  are  positive.] 

29.  Prove  that  if  a  >  1  then 

f1  dx  7 r 

7-i  (a  —  x)y/l  —  x2  \J a2  —  1 


30.  Establish  the  formulae 


roc  _  roc  /  1  \ 

I  F{  \fx 2  +  1  +  x}  dx  =  \  J  (l  +  ^)F(y)dy, 
j  F{  \Jx2  +  1  -  x}  dx  =  \  J  (^l  +  F(y)  dy. 


In  particular,  prove  that  if  n  >  1  then 

r°°  dx 


1 0  {  Vx2  +  1  +  x}r 


roo 

/  { 'sj x2  T  1  —  x}n  dx  = 

Jo 


n 


n2  —  1 
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[In  this  and  the  succeeding  examples  it  is  of  course  supposed  that  the  arbi¬ 
trary  functions  which  occur  are  such  that  the  integrals  considered  have  a  meaning 
in  accordance  with  the  definitions  of  §§  177  et  seq.\ 

31.  Show  that  if  2 y  =  ax  —  (b/x),  where  a  and  b  are  positive,  then  y  increases 
steadily  from  — oo  to  oo  as  x  increases  from  0  to  oo.  Hence  show  that 


-  [  f{  Vv2  +  ab}  { 1  +  ^  1  dy 

a  J- oo  [  \Jy2  +  ab  J 

-  /  /{  vV  +  ob}  dy. 

a  Jo 


32.  Show  that  if  2 y  =  ax  +  (b/x),  where  a  and  b  are  positive,  then  two  values 
of  x  correspond  to  any  value  of  y  greater  than  \fab.  Denoting  the  greater  of 
these  by  x\  and  the  less  by  X2,  show  that,  as  y  increases  from  y/ab  towards  oo, 
x\  increases  from  yjb/a  towards  oo,  and  X2  decreases  from  yfbja  to  0.  Hence 
show  that 


y 


dXl  =  -  r  f(y)  J  _ 

a  J  Vab  )  yjy1  —  ab 


r  OO 

lf(y) 

J  y/b/a 

r  \fbja  i  ro o 

L  mdX2  =  °LmWy^i 


y 


+  dy, 
-  1  i  dy, 


and  that 


rf{?{ax+i)}dx=ui 


y/y2  —  ab  a  Jo 


33.  Prove  the  formula 


dx 


/  /(sec  ^ x  +  tan  ^x)  - 
I  o  V  sin  x 


r  t<  \  dx 

/  t (cosec x)  , 

1 0  Vsinx 


34.  If  a  and  b  are  positive,  then 


dx 

(. x 2  +  a2)(x2  +  b2) 


7 r 

2 ab(a  +  b)  ’ 


x2  dx 

( x 2  +  a2)(x2  +  b2) 


7 r 

2(a  +  b) 
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Deduce  that  if  a,  /3 ,  and  7  are  positive,  and  f32  ^  cry,  then 
dx  it  x2  dx 


f 

Jo 


ax 4  +  2  f3x2  +  7  2  \J2^A 


•  f 


7T 


ax4  +  2(3x2  +  7  2y/2aA' 


where  A  =  [3  +  y/ar).  Also  deduce  the  last  result  from  Ex.  31,  by  putting 
f(y)  =  l/(c2  +  y2).  The  last  two  results  remain  true  when  f3 2  <  cry,  but  their 
proof  is  then  not  quite  so  simple. 

35.  Prove  that  if  b  is  positive  then 


f 


x  2  dx 


TT 


■  L 


x4  dx 


TT 


o  (x2  —  a2)2  +  b2x2  2b'  J0  {(x2  —  a2)2  +  b2  x2}2  4  b3 

36.  Extend  Schwarz’s  inequality  (Ch.  VII,  Misc.  Ex.  42)  to  infinite  integrals 
of  the  first  and  second  kinds. 

37.  Prove  that  if  4>(x)  is  the  function  considered  at  the  end  of  §  178  then 


poo  00 

/  4>{x)  dx  =  y2 

Jo 


1 


(n  +  l)s 


38.  Prove  that 


s; 

ro o 


dx 


x  -  y  ,  1  _  , 

(x  +  .,/)••*  l  lJ  * 


dy 


^3  dx  )  =  1; 
(x  +  y)3  1 


dx 


r°c  x2  _  y2 

£  (x2  +  y2)2 


dv)  =  -^,  l  Ay  (l 


00  x2  —  y2 


(x2  +  y2)2  dX  ) 


Establish  similar  results  in  which  the  limits  of  integration  are  0  and  1. 

(Math.  Trip.  1913.) 


CHAPTER  IX 


THE  LOGARITHMIC  AND  EXPONENTIAL  FUNCTIONS 
OF  A  REAL  VARIABLE 

196.  The  number  of  essentially  different  types  of  functions  with  which 
we  have  been  concerned  in  the  foregoing  chapters  is  not  very  large.  Among 
those  which  have  occurred  the  most  important  for  ordinary  purposes  are 
polynomials,  rational  functions,  algebraical  functions,  explicit  or  implicit, 
and  trigonometrical  functions,  direct  or  inverse. 

We  are  however  far  from  having  exhausted  the  list  of  functions  which 
are  important  in  mathematics.  The  gradual  expansion  of  the  range  of 
mathematical  knowledge  has  been  accompanied  by  the  introduction  into 
analysis  of  one  new  class  of  function  after  another.  These  new  functions 
have  generally  been  introduced  because  it  appeared  that  some  problem 
which  was  occupying  the  attention  of  mathematicians  was  incapable  of 
solution  by  means  of  the  functions  already  known.  The  process  may  fairly 
be  compared  with  that  by  which  the  irrational  and  complex  numbers  were 
first  introduced,  when  it  was  found  that  certain  algebraical  equations  could 
not  be  solved  by  means  of  the  numbers  already  recognised.  One  of  the 
most  fruitful  sources  of  new  functions  has  been  the  problem  of  integration. 
Attempts  have  been  made  to  integrate  some  function  f(x)  in  terms  of 
functions  already  known.  These  attempts  have  failed;  and  after  a  certain 
number  of  failures  it  has  begun  to  appear  probable  that  the  problem  is 
insoluble.  Sometimes  it  has  been  proved  that  this  is  so;  but  as  a  rule  such 
a  strict  proof  has  not  been  forthcoming  until  later  on.  Generally  it  has 
happened  that  mathematicians  have  taken  the  impossibility  for  granted  as 
soon  as  they  have  become  reasonably  convinced  of  it,  and  have  introduced 
a  new  function  F{x)  defined  by  its  possessing  the  required  property,  viz. 
that  F'{x)  =  f(x).  Starting  from  this  definition,  they  have  investigated  the 
properties  of  F(x)]  and  it  has  then  appeared  that  F{x)  has  properties  which 
no  finite  combination  of  the  functions  previously  known  could  possibly 
have;  and  thus  the  correctness  of  the  assumption  that  the  original  problem 
could  not  possibly  be  solved  has  been  established.  One  such  case  occurred 
in  the  preceding  pages,  when  in  Ch.  VI  we  defined  the  function  logo;  by 
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means  of  the  equation 


log  a; 


/ 


dx 


x 


Let  us  consider  what  grounds  we  have  for  supposing  logx  to  be  a  really 
new  function.  We  have  seen  already  (Ex.  XLII.  4)  that  it  cannot  be  a  rational 
function,  since  the  derivative  of  a  rational  function  is  a  rational  function  whose 
denominator  contains  only  repeated  factors.  The  question  whether  it  can  be  an 
algebraical  or  trigonometrical  function  is  more  difficult.  But  it  is  very  easy  to 
become  convinced  by  a  few  experiments  that  differentiation  will  never  get  rid 
of  algebraical  irrationalities.  For  example,  the  result  of  differentiating  y/1  +  x 
any  number  of  times  is  always  the  product  of  \/l  +  x  by  a  rational  function, 
and  so  generally.  The  reader  should  test  the  correctness  of  the  statement  by 
experimenting  with  a  number  of  examples.  Similarly,  if  we  differentiate  a  func¬ 
tion  which  involves  sinx  or  cosx,  one  or  other  of  these  functions  persists  in  the 
result. 

We  have,  therefore,  not  indeed  a  strict  proof  that  logx  is  a  new  function — 
that  we  do  not  profess  to  give* — but  a  reasonable  presumption  that  it  is.  We 
shall  therefore  treat  it  as  such,  and  we  shall  find  on  examination  that  its  prop¬ 
erties  are  quite  unlike  those  of  any  function  which  we  have  as  yet  encountered. 

197.  Definition  of  log  a;.  We  define  log  a;,  the  logarithm  of  x,  by  the 
equation 


We  must  suppose  that  x  is  positive,  since  (Ex.  LXXVI.  2)  the  integral  has 
no  meaning  if  the  range  of  integration  includes  the  point  x  —  0.  We 
might  have  chosen  a  lower  limit  other  than  1;  but  1  proves  to  be  the  most 
convenient.  With  this  definition  logl  =  0. 

We  shall  now  consider  how  log  a;  behaves  as  x  varies  from  0  towards  oo. 
It  follows  at  once  from  the  definition  that  log  a:  is  a  continuous  function 
of  x  which  increases  steadily  with  x  and  has  a  derivative 


Dx  log  x  =  1/x; 


For  such  a  proof  see  the  author’s  tract  quoted  on  p.  286. 
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and  it  follows  from  §  175  that  logx  tends  to  oo  as  x  — »  oo. 

If  x  is  positive  but  less  than  1,  then  logx  is  negative.  For 


logx 


<  0. 


Moreover,  if  we  make  the  substitution  t  —  1/u  in  the  integral,  we  obtain 


logx 


1/xdu 

u 


l°g(l/ x)- 


Thus  logo:  tends  steadily  to  —  oo  as  x  decreases  from  1  to  0. 

The  general  form  of  the  graph  of  the  logarithmic  function  is  shown  in 
Fig.  52.  Since  the  derivative  of  logo:  is  1/x,  the  slope  of  the  curve  is  very 


gentle  when  x  is  very  large,  and  very  steep  when  x  is  very  small. 


Examples  LXXXII.  1.  Prove  from  the  definition  that  if  u  >  0  then 
u/ (1  +  it)  <  log(l  +  u)  <u. 


[For  log(l  +  tt) 

1/(1  +  «).] 


dt 

1  +  t’ 


o 


and  the  subject  of  integration  lies  between  1  and 
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V?1  v? 

2.  Prove  that  log(l  +  u)  lies  between  u  — —  and  u  —  — — - - 

2  2  ( 1  uj 

fu  t  dt 

positive.  [Use  the  fact  that  log(l  +  u)  =  u  —  J  ^  +  ^.] 

3.  If  0  <  u  <  1  then  u  <  —  log(l  —  u)  <  u/(l  —  u). 

4.  Prove  that 


when  u  is 


log  x  log(l  +  t) 

Inn -  =  hm -  =  1. 

I->1  x  —  1  t->  o  t, 


[Use  Ex.  1.] 


198.  The  functional  equation  satisfied  by  logo:.  The  function 
log  x  satisfies  the  functional  equation 


f{xy)  =  f{x)  +  f(y). 

For,  making  the  substitution  t  =  yu ,  we  see  that 


rxv  dt 


du 


du 


du 


log  XV  —  -=/--/  - 


t 


1  i/y 


u 


u 


u 


=  log  a;  —  log(l  /  a/)  =  logo;  +  logy, 
which  proves  the  theorem. 


(1) 


Examples  LXXXIII.  1.  It  can  be  shown  that  there  is  no  solution  of 
the  equation  (1)  which  possesses  a  differential  coefficient  and  is  fundamentally 
distinct  from  log  x.  For  when  we  differentiate  the  functional  equation,  first  with 
respect  to  x  and  then  with  respect  to  y,  we  obtain  the  two  equations 

yf'{xy )  =  /'( x),  xf'{xy )  =  f(y); 


and  so,  eliminating  f  (xy) ,  xf'{x)  =  yf'{y)-  But  if  this  is  true  for  every  pair  of 
values  of  x  and  y ,  then  we  must  have  xf'(x)  =  C ,  or  f'(x)  =  C/x,  where  C  is  a 
constant.  Hence 


f(x) 


- dx  +  C'  =  Clogx  +  C', 
x 


and  it  is  easy  to  see  that  C'  =  0.  Thus  there  is  no  solution  fundamentally  distinct 
from  log  x,  except  the  trivial  solution  f(x)  =  0,  obtained  by  taking  (7  =  0. 
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2.  Show  in  the  same  way  that  there  is  no  solution  of  the  equation 

f(x)  +  f(y)  =  f(^L) 

V1  -  xvJ 

which  possesses  a  differential  coefficient  and  is  fundamentally  distinct  from 
arc  tan  x. 


199.  The  manner  in  which  logx  tends  to  infinity  with  x.  It  will 

be  remembered  that  in  Ex.  XXXVI.  6  we  defined  certain  different  ways  in 
which  a  function  of  x  may  tend  to  infinity  with  x ,  distinguishing  between 
functions  which,  when  x  is  large,  are  of  the  first,  second,  third,  . . .  orders 
of  greatness.  A  function  f(x)  was  said  to  be  of  the  kth  order  of  greatness 
when  /  ( x )  / xk  tends  to  a  limit  different  from  zero  as  x  tends  to  infinity. 

It  is  easy  to  define  a  whole  series  of  functions  which  tend  to  infinity 
with  x ,  but  whose  order  of  greatness  is  smaller  than  the  first.  Thus  y/x, 
<yx,  -\/x,  . . .  are  such  functions.  We  may  say  generally  that  xa,  where  a  is 
any  positive  rational  number,  is  of  the  atli  order  of  greatness  when  x  is 
large.  We  may  suppose  a  as  small  as  we  please,  e.g.  less  than  .000  0001. 
And  it  might  be  thought  that  by  giving  a  all  possible  values  we  should 
exhaust  the  possible  ‘orders  of  infinity’  of  f(x).  At  any  rate  it  might  be 
supposed  that  if  f(x)  tends  to  infinity  with  x,  however  slowly,  we  could 
always  find  a  value  of  a  so  small  that  xa  would  tend  to  infinity  more  slowly 
still;  and,  conversely,  that  if  f(x)  tends  to  infinity  with  x,  however  rapidly, 
we  could  always  find  a  value  of  a  so  great  that  xa  would  tend  to  infinity 
more  rapidly  still. 

Perhaps  the  most  interesting  feature  of  the  function  logo;  is  its  be¬ 
haviour  as  x  tends  to  infinity.  It  shows  that  the  presupposition  stated 
above,  which  seems  so  natural,  is  unfounded.  The  logarithm  of  x  tends  to 
infinity  with  x,  but  more  slowly  than  any  positive  power  of  x,  integral  or 
fractional.  In  other  words  logx  -A  oo  but 


logo: 

,Ta 


0 


for  all  positive  values  of  a.  This  fact  is  sometimes  expressed  loosely  by  say¬ 
ing  that  the  ‘order  of  infinity  of  logx  is  infinitely  small’;  but  the  reader  will 
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hardly  require  at  this  stage  to  be  warned  against  such  modes  of  expression. 


200.  Proof  that  (loga;)/a;a  — >  0  as  x  — >  oo.  Let  P  be  any  positive 
number.  Then  1/t  <  l/f1-'3  when  t  >  1,  and  so 


logo: 


dt 


or 

log  a;  <  (a/3  —  1  )//3  <  x^/P, 

when  x  >  1.  Now  if  a  is  any  positive  number  we  can  choose  a  smaller 
positive  value  of  ft.  And  then 


0  <  (logx)/xa  <  x13  “ /P  (x  >  1). 


But,  since  a  >  P,  x13  a/ P  — >  0  as  x  — »  oo,  and  therefore 


(\ogx)/xa  -A  0. 


201.  The  behaviour  of  logo;  as  x  — >  +0.  Since 

(\ogx)/xa  =  -y°  log  y 

if  x  =  l/y.  it  follows  from  the  theorem  proved  above  that 
lim  yalogy  =  —  lim  (log  x) /xa  =  0. 

y— >-+0  x— >-+oo 


Thus  logo;  tends  to  — oo  and  log(l/a;)  =  —  logo:  to  oo  as  a;  tends  to  zero 
by  positive  values,  but  log(l/a;)  tends  to  oo  more  slowly  than  any  positive 
power  of  1/a;,  integral  or  fractional. 
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202.  Scales  of  infinity.  The  logarithmic  scale.  Let  us  consider 
once  more  the  series  of  functions 

x,  y/x,  yfx,  . . . ,  yfx ,  . . . , 

which  possesses  the  property  that,  if  /(x)  and  </>(x)  are  any  two  of  the  functions 
contained  in  it,  then  f(x)  and  0(x)  both  tend  to  oo  as  x  — >  oo,  while  f{x)/(j){x) 
tends  to  0  or  to  oo  according  as  f(x)  occurs  to  the  right  or  the  left  of  < f>(x)  in 
the  series.  We  can  now  continue  this  series  by  the  insertion  of  new  terms  to  the 
right  of  all  those  already  written  down.  We  can  begin  with  logx,  which  tends  to 
infinity  more  slowly  than  any  of  the  old  terms.  Then  flog  x  tends  to  oo  more 
slowly  than  logx,  flogx  than  \/log x,  and  so  on.  Thus  we  obtain  a  series 

X,  y/x,  yfx,  ...,  yfx,  ...  logX,  y/logX,  y/logX,  ...  y/logX,  ... 

formed  of  two  simply  infinite  series  arranged  one  after  the  other.  But  this  is  not 
all.  Consider  the  function  log  logx,  the  logarithm  of  logx.  Since  (logx)/xQ  — >  0, 
for  all  positive  values  of  a,  it  follows  on  putting  x  =  logy  that 

(log log y)/ (log y)a  =  (logx)/xQ  0. 

Thus  log  logy  tends  to  oo  with  y,  but  more  slowly  than  any  power  of  logy. 
Hence  we  may  continue  our  series  in  the  form 

X,  y/x,  yfx,  ...  logx,  y/\ogX,  y/logX,  ... 
log  log  x,  y/log  logx,  . . .  v/Ioglogx,  . . . ; 

and  it  will  by  now  be  obvious  that  by  introducing  the  functions  log  log  logx, 
log  log  log  log  x,  ...  we  can  prolong  the  series  to  any  extent  we  like.  By  putting 
x  =  1/y  we  obtain  a  similar  scale  of  infinity  for  functions  of  y  which  tend  to  oo 
as  y  tends  to  0  by  positive  values.* 

Examples  LXXXIV.  1.  Between  any  two  terms  /(x),  F(x)  of  the 
series  we  can  insert  a  new  term  (f>(x)  such  that  4>(x)  tends  to  oo  more  slowly 
than  /(x)  and  more  rapidly  than  F{x).  [Thus  between  y/x  and  yfx  we  could 

*For  fuller  information  as  to  ‘scales  of  infinity’  see  the  author’s  tract  ‘Orders  of 
Infinity’,  Carnb.  Math.  Tracts ,  No.  12. 
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insert  x5/12:  between  y/logx  and  //logx  we  could  insert  (logx)5/12.  And,  gen¬ 
erally,  <f>(x)  =  \J f(x)F(x )  satisfies  the  conditions  stated.] 

2.  Find  a  function  which  tends  to  oo  more  slowly  than  y/x,  but  more  rapidly 
than  x“,  where  a  is  any  rational  number  less  than  1/2.  [y/x/ (logx)  is  such  a 
function;  or  y/x/ (logx)^ ,  where  f5  is  any  positive  rational  number.] 

3.  Find  a  function  which  tends  to  oo  more  slowly  than  y/x,  but  more  rapidly 
than  y/x/(logx)“,  where  a  is  any  rational  number.  [The  function  y/xj (log  logx) 
is  such  a  function.  It  will  be  gathered  from  these  examples  that  incompleteness 
is  an  inherent  characteristic  of  the  logarithmic  scale  of  infinity.] 

4.  How  does  the  function 

/(x)  =  {xa(logx)a  (loglogx)“  } / {x^ ilogx)13  (loglogx)^  } 
behave  as  x  tends  to  oo?  [If  a  /  j3  then  the  behaviour  of 
/(x)  =  xa~l3(logx)a  (loglogx)“ 

is  dominated  by  that  of  xa~P.  If  a  =  (3  then  the  power  of  x  disappears  and  the 
behaviour  of  f(x)  is  dominated  by  that  of  (logx)a  ~3  ,  unless  a!  =  /?',  when  it 
is  dominated  by  that  of  (loglogx)“  ~'3  .  Thus  f(x)  — >  oo  if  a  >  (5,  or  a  =  (3, 
a'  >  /3' ,  or  a  =  (5,  a'  =  f3' ,  a"  >  f3" ,  and  f(x)  — >  0  if  a  <  /?,  or  a  =  j3,  a'  <  f3' , 
or  a  =  /3,  a'  =  /3' ,  a "  <  /3".\ 

5.  Arrange  the  functions  x/y/Togx ,  xydoga:/ loglog.x,  x\og\ogx/ y/\ogx, 
(x  log  log  log  x)  /  \/log  log  x  according  to  the  rapidity  with  which  they  tend  to 
infinity  as  x  — >  oo. 

6.  Arrange 

loglogx/(xlogx),  (logx)/x,  x  log  log  x/  \J  x1  +  1,  {y/x  +  l}/x(logx)2 

according  to  the  rapidity  with  which  they  tend  to  zero  as  x  — >  oo. 

7.  Arrange 

xloglog(l/x),  yfx / {log(l^ / ^x)} ,  -\/xsinxlog(l/x),  (1  —  cosx)  log(l/x) 

according  to  the  rapidity  with  which  they  tend  to  zero  as  x  — >  +0. 

8.  Show  that 

Dx  log  log  x  =  1/ (x  log  x) ,  Dx  log  log  log  x  =  1/ (x  log  x  log  log  x) , 
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and  so  on. 

9.  Show  that 

Ac  (log  x)a  =  a/{x{\ ogx)1'"},  Dx(  log  logic)"  =  a/{xlogx(logloga;)1_"}, 
and  so  on. 

203.  The  number  e.  We  shall  now  introduce  a  number,  usually 
denoted  by  e,  which  is  of  immense  importance  in  higher  mathematics.  It 
is,  like  7T,  one  of  the  fundamental  constants  of  analysis. 

We  define  e  as  the  number  whose  logarithm  is  1.  In  other  words  e  is 
defined  by  the  equation 


Since  logx  is  an  increasing  function  of  x ,  in  the  stricter  sense  of  §  95,  it 
can  only  pass  once  through  the  value  1.  Hence  our  definition  does  in  fact 
define  one  definite  number. 

Now  log  xy  =  log  x  +  log  y  and  so 

logic2  =  2 logo:,  logic3  =  3 log#,  ...,  logic”  =  n log x, 

where  n  is  any  positive  integer.  Hence 

log  en  =  n  log  e  =  n. 

Again,  if  p  and  q  are  any  positive  integers,  and  ep'q  denotes  the  positive 
qth  root  of  ep,  we  have 

p  =  logep  =  log(ep^q)q  =  q  logep^q, 

so  that  \ogep!q  =  p/q.  Thus,  if  y  has  any  positive  rational  value,  and 
ey  denotes  the  positive  yth  power  of  e,  we  have 

log  ey  =  y,  (1) 

and  \oge~y  =  —  logey  =  —y.  Hence  the  equation  (1)  is  true  for  all  rational 
values  of  y,  positive  or  negative.  In  other  words  the  equations 


y  =  logic,  x  =  ey 


(2) 
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are  consequences  of  one  another  so  long  as  y  is  rational  and  ey  has  its 
positive  value.  At  present  we  have  not  given  any  definition  of  a  power  such 
as  ev  in  which  the  index  is  irrational,  and  the  function  ey  is  defined  for 
rational  values  of  y  only. 


Example.  Prove  that  2  <  e  <  3.  [In  the  first  place  it  is  evident  that 


and  so  2  <  e.  Also 


<  1, 


du 


u 


du 

2  +  u 


Jo  4 -«2 


>  1, 


so  that  e  <  3.] 


204.  The  exponential  function.  We  now  define  the  exponential 
function  ey  for  all  real  values  of  y  as  the  inverse  of  the  logarithmic  function. 
In  other  words  we  write 

x  =  ey 

if  y  =  logx. 

We  saw  that,  as  x  varies  from  0  towards  oo,  y  increases  steadily,  in  the 
stricter  sense,  from  —  cx>  towards  oo.  Thus  to  one  value  of  x  corresponds 
one  value  of  y,  and  conversely.  Also  y  is  a  continuous  function  of  x,  and  it 
follows  from  §  109  that  x  is  likewise  a  continuous  function  of  y. 

It  is  easy  to  give  a  direct  proof  of  the  continuity  of  the  exponential  function. 
For  if  x  =  ey  and  x  +  £  =  ey+y  then 

[x+*  dt 

n  =  L  T 

Thus  \y\  is  greater  than  £/(x  +  £)  if  £  >  0,  and  than  |£|/x  if  £  <  0;  and  if  rj  is 
very  small  £  must  also  be  very  small. 

Thus  ey  is  a  positive  and  continuous  function  of  y  which  increases 
steadily  from  0  towards  oo  as  y  increases  from  —  oo  towards  oo.  More¬ 
over  ey  is  the  positive  ytli  power  of  the  number  e,  in  accordance  with 
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the  elementary  definitions,  whenever  y  is  a  rational  number.  In  particular 
ey  =  1  when  y  =  0.  The  general  form  of  the  graph  of  ey  is  as  shown  in 
Fig.  53. 


205.  The  principal  properties  of  the  exponential  function. 

(1)  If  x  =  ey,  so  that  y  =  logo;,  then  dy/dx  =  1/x  and 


Thus  the  derivative  of  the  exponential  function  is  equal  to  the  function 
itself.  More  generally,  if  x  =  eay  then  dx/dy  =  aeay . 

(2)  The  exponential  function  satisfies  the  functional  equation 

f(y  +  z)  =  f(y)f(z). 

This  follows,  when  y  and  z  are  rational,  from  the  ordinary  rules  of 
indices.  If  y  or  z,  or  both,  are  irrational  then  we  can  choose  two  sequences 
yi ,  y2,  •  •  • ,  yn,  ■  ■  ■  and  z±,  z2,  . . . ,  zni  ...  of  rational  numbers  such  that 
lim  yn  =  y.  lim  zn  =  z.  Then,  since  the  exponential  function  is  continuous, 
we  have 

ey  x  e2  =  lime2'"  x  lim eZn  =  limes“+z"  =  ev+z . 

In  particular  ey  x  e~y  —  e°  —  1,  or  e~y  =  l/ey . 
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We  may  also  deduce  the  functional  equation  satisfied  by  ey  from  that 
satisfied  by  logx.  For  if  y\  =  log^i,  1/2  =  logX2,  so  that  x\  =  eyi,  X2  =  eV2, 
then  yi  +  ]j2  —  logon  +  log  £2  =  log x \ X2  and 

g  2/1 +2/2  _  glog  3-1X2  _  XlX2  —  g22l  x  g222_ 


Examples  LXXXV.  1.  If  dx/dy  =  ax  then  x  =  Keay,  where  K  is  a 
constant. 

2.  There  is  no  solution  of  the  equation  f(y  +  z)  =  f(y)f(z)  fundamentally 
distinct  from  the  exponential  function.  [We  assume  that  f(y)  has  a  differential 
coefficient.  Differentiating  the  equation  with  respect  to  y  and  z  in  turn,  we 
obtain 

f\y  +  z)  =  f'(y)f(z),  f'(y  +  z)  =  f{y)f{z) 

and  so  f'(y)/f{y )  =  f(z)/f(z),  and  therefore  each  is  constant.  Thus  if  x  =  f(y) 
then  dx/dy  =  ax,  where  a  is  a  constant,  so  that  x  =  Keay  (Ex.  1).] 

3.  Prove  that  ( eay  —  l)/y  — >  a  as  y  — >  0.  [Applying  the  Mean  Value 
Theorem,  we  obtain  eay  —  1  =  ayeart,  where  0  <  \r]\  <  \y\.] 


206.  (3)  The  function  ey  tends  to  infinity  with  y  more  rapidly  than 

any  power  of  y,  or 

lim  ya/ey  =  lim  e~vya  =  0 
as  y  — *  00,  for  all  values  of  a  however  great. 

We  saw  that  {\og  x)/x^  — )■  0  as  x  — *  00,  for  any  positive  value  of  j3 
however  small.  Writing  a  for  1//3,  we  see  that  (loga:)a/a;  — >  0  for  any 
value  of  a  however  large.  The  result  follows  on  putting  x  =  ey .  It  is  clear 
also  that  e7y  tends  to  00  if  7  >  0,  and  to  0  if  7  <  0,  and  in  each  case  more 
rapidly  than  any  power  of  y. 

From  this  result  it  follows  that  we  can  construct  a  ‘scale  of  infinity’  similar 
to  that  constructed  in  §  202,  but  extending  in  the  opposite  direction;  i.e.  a  scale 
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of  functions  which  tend  to  oo  more  and  more  rapidly  as  x  — >  oo.*  The  scale  is 


x ,  x2,  x3, 


2x 


where  of  course  e®2,  . . . ,  e£X ,  . . .  denote  e^x2\  . . . ,  e^£X\ _ 

The  reader  should  try  to  apply  the  remarks  about  the  logarithmic  scale, 
made  in  §  202  and  Exs.  LXXXIV,  to  this  ‘exponential  scale’  also.  The  two  scales 
may  of  course  (if  the  order  of  one  is  reversed)  be  combined  into  one  scale 


. ..  log  log  x,  ...  logx,  ...  x,  ...  ex,  ...  ee  ,  . . . . 


207.  The  general  power  ax.  The  function  ax  has  been  defined  only 
for  rational  values  of  x,  except  in  the  particular  case  when  a  =  e.  We  shall 
now  consider  the  case  in  which  a  is  any  positive  number.  Suppose  that  x  is 
a  positive  rational  number  p/q.  Then  the  positive  value  y  of  the  power  ap^q 
is  given  by  yq  =  ap ;  from  which  it  follows  that 

q  log  y  =  p  log  a,  log  y  —  (p/q)  log  a  =  x  log  a, 


and  so 

y  =  exloga. 

We  take  this  as  our  definition  of  ax  when  x  is  irrational.  Thus 
10^  =  e^10®10.  It  is  to  be  observed  that  ax,  when  x  is  irrational,  is 
defined  only  for  positive  values  of  a,  and  is  itself  essentially  positive;  and 
that  log  ax  =  x  log  a.  The  most  important  properties  of  the  function  ax 
are  as  follows. 

*The  exponential  function  was  introduced  by  inverting  the  equation  y  =  logx  into 
x  =  ey:  and  we  have  accordingly,  up  to  the  present,  used  y  as  the  independent  and  x  as 
the  dependent  variable  in  discussing  its  properties.  We  shall  now  revert  to  the  more 
natural  plan  of  taking  x  as  the  independent  variable,  except  when  it  is  necessary  to 
consider  a  pair  of  equations  of  the  type  y  =  logo;,  x  =  ey  simultaneously,  or  when  there 
is  some  other  special  reason  to  the  contrary. 
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(1)  Whatever  value  a  may  have,  ax  x  ay  =  ax+y  and  ( ax)y  =  axy .  In 
other  words  the  laws  of  indices  hold  for  irrational  no  less  than  for  rational 
indices.  For,  in  the  first  place, 

ax  x  ay  _  gX  log  a  x  log  a  _  e(x+y)\oga  _ 

and  in  the  second 


(nx\y  —  eyl°SaX  —  £xyloga  _  axy 

(2)  If  a  >  1  then  ax  =  exXoga  =  eax,  where  a  is  positive.  The  graph 
of  ax  is  in  this  case  similar  to  that  of  ex,  and  ax  — >•  cx)  as  x  — >  oo,  more 
rapidly  than  any  power  of  x. 

If  a  <  1  then  ax  =  exl°sa  =  e~Px,  where  (3  is  positive.  The  graph  of  ax 
is  then  similar  in  shape  to  that  of  ex,  but  reversed  as  regards  right  and  left, 
and  ax  — >  0  as  x  — >  oo,  more  rapidly  than  any  power  of  1  jx. 

(3)  ax  is  a  continuous  function  of  x,  and 

Dxax  =  Dxexloga  =  e'Tlogalog  a  =  a^loga. 

(4)  ax  is  also  a  continuous  function  of  a,  and 

Daax  =  Daexl°sa  =  exloga(x/a )  =  xax~l. 

(5)  ( ax  —  l)/x  — >  log  a  as  x  — >  0.  This  of  course  is  a  mere  corollary 
from  the  fact  that  Dxax  =  ax  log  a,  but  the  particular  form  of  the  result 
is  often  useful;  it  is  of  course  equivalent  to  the  result  (Ex.  LXXXV.  3)  that 
(■ eax  —  l)/x  — >  a  as  x  — »  0. 

In  the  course  of  the  preceding  chapters  a  great  many  results  involving  the 
function  ax  have  been  stated  with  the  limitation  that  x  is  rational.  The  definition 
and  theorems  given  in  this  section  enable  us  to  remove  this  restriction. 

208.  The  representation  of  ex  as  a  limit.  In  Ch.  IV,  §  73,  we 

proved  that  {1  +  (l/n)}n  tends,  as  n  — >  oo,  to  a  limit  which  we  denoted 
provisionally  by  e.  We  shall  now  identify  this  limit  with  the  number  e  of 
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the  preceding  sections.  We  can  however  establish  a  more  general  result, 
viz.  that  expressed  by  the  equations 


lim 

n— >oo 


lim  (l  -  -)  "  =  ex. 

n—>oo  V  n  / 


(1) 


As  the  result  is  of  very  great  importance,  we  shall  indicate  alternative  lines 
of  proof. 

(1)  Since 


d 

dt 


log(l  +  xt) 


x 

1  +  xt' 


it  follows  that 


lim 

h- 5>0 


log(l  +  xh ) 
h 


=  X. 


If  we  put  h  =  l/£,  we  see  that 


lim£  log  ^1  +  —  J  =  x 

as  £  — >  oo  or  £  — >  —  oo.  Since  the  exponential  function  is  continuous  it 
follows  that 

X  _|_  —  e^log{l+(a:/C}  ex 

as  £  — »  oo  or  £  — >•  —  oo:  i.e.  that 

x\ ^  /  xx  ^ 


£—>•00 


t 


lim  1  H —  =  lim  1  +  —  =  e 


£— >•— oo 


t 


(2) 


If  we  suppose  that  £  — >■  00  or  £  — *  —  00  through  integral  values  only,  we 
obtain  the  result  expressed  by  the  equations  (1). 

(2)  If  n  is  any  positive  integer,  however  large,  and  x  >  1,  we  have 


dt 


rx  dt 
<  I  -  < 


dt 


or 


J 1  t^l/n)  -  ^  t  -J1  tl-(l/n)> 
n(l  —  x~1/n)  <  logx  <  n(x1^n  —  1). 


(3) 
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Writing  y  for  logx,  so  that  y  is  positive  and  x  =  ey, 
transformations , 


1  H — 
n 


<  x  < 


we  obtain,  after  some  simple 


—n 

(4) 


Now  let 


,  .  y  ,  y  i 
i  +  -  =  vi,  1  —  =  — • 

n  n  r]2 


Then  0  <  r)\  <  772 ,  at  any  rate  for  sufficiently  large  values  of  n;  and,  by  (9)  of 
§  74, 

ri2~Vi  <  nV2  1(V2  -  m)  =  y2V 2/n, 


which  evidently  tends  to  0  as  n  — >  00.  The  result  now  follows  from  the  inequal¬ 
ities  (4).  The  more  general  result  (2)  may  be  proved  in  the  same  way,  if  we 
replace  1/n  by  a  continuous  variable  h. 


209.  The  representation  of  logx  as  a  limit.  We  can  also  prove  (cf. 
§  75)  that 

lirn  n(l  —  x -1/Tl)  =  limn(x1/n  —  1)  =  logx. 

For 

n{xl/n  -  1)  -  n(  1  -  x-1^)  =  n(x1/n  -  1)(1  -  x '1/n), 

which  tends  to  zero  as  n  — >  00,  since  n(x1//n  —  1)  tends  to  a  limit  (§  75)  and 
x~1/n  to  1  (Ex.  XXVII.  10).  The  result  now  follows  from  the  inequalities  (3)  of 
S  208. 


Examples  LXXXVI.  1.  Prove,  by  taking  y  =  1  and  n  =  6  in  the 
inequalities  (4)  of  §  208,  that  2.5  <  e  <  2.9. 

2.  Prove  that  if  t  >  1  then  (t1//n  —  f-1//n)/(t  —  t-1)  <  1/n,  and  so  that  if 
x  >  1  then 

f_^< 1  r(t-i)dt=i  (x+i-2). 

Ji  U-f1/71)  ,/j  n  J1  \  t )  t  n  \  x  ) 

Hence  deduce  the  results  of  §  209. 

3.  If  £n  is  a  function  of  n  such  that  n£n  — >  l  as  n  — >  00,  then  (1  +  £n)n  ->  el. 
[Writing  nlog(l  +  £n)  in  the  form 

f <n\  log(l  +  jn) 

V  l  )  Cn  ’ 
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and  using  Ex.  Lxxxn.  4,  we  see  that  nlog(l  +  £n)  — >  l.] 

4.  If  n£n  — >  oo,  then  (1  +  £ n)n  — »•  oo;  and  if  1  +  >  0  and  n^n  -»  — oo, 

then 

(i  +  Un^o. 


5.  Deduce  from  (1)  of  §  208  the  theorem  that  ey  tends  to  infinity  more 
rapidly  than  any  power  of  y. 


210.  Common  logarithms.  The  reader  is  probably  familiar  with 
the  idea  of  a  logarithm  and  its  use  in  numerical  calculation.  He  will  re¬ 
member  that  in  elementary  algebra  loga  x,  the  logarithm  of  x  to  the  base  a, 
is  defined  by  the  equations 


x  =  ay ,  y  =  logax. 

This  definition  is  of  course  applicable  only  when  y  is  rational,  though  this 
point  is  often  passed  over  in  silence. 

Our  logarithms  are  therefore  logarithms  to  the  base  e.  For  numerical 
work  logarithms  to  the  base  10  are  used.  If 

y  =  log  x  =  loge  x,  2  =  log10  x , 

then  x  =  ey  and  also  x  =  102  =  ezlogl°,  so  that 

iogio^  =  (ioge^O/Qoge  10)' 

Thus  it  is  easy  to  pass  from  one  system  to  the  other  when  once  loge  10  has 
been  calculated. 

It  is  no  part  of  our  purpose  in  this  book  to  go  into  details  concerning 
the  practical  uses  of  logarithms.  If  the  reader  is  not  familiar  with  them  he 
should  consult  some  text-book  on  Elementary  Algebra  or  Trigonometry.* 

*See  for  example  Chrystal’s  Algebra ,  vol.  i,  ch.  XXI.  The  value  of  loge  10  is  2.302  . . . 
and  that  of  its  reciprocal  .434 .... 


[IX  :  210]  THE  LOGARITHMIC  AND  EXPONENTIAL  FUNCTIONS 


461 


Examples  LXXXVII.  1.  Show  that 

Dxeax  cos  bx  =  reax  cos (bx  +  9),  Dxeax  sin  bx  =  reax  sin (bx  +  9) 

where  r  =  V  a2  +  b2,  cos  9  =  a/r,  sin$  =  b/r.  Hence  determine  the  nth  deriva¬ 
tives  of  the  functions  eax  cosbx,  eaxsmbx,  and  show  in  particular  that 
D”eax  =  aneax. 

2.  Trace  the  curve  y  =  e~ax  s'mbx,  where  a  and  b  are  positive.  Show  that 
y  has  an  infinity  of  maxima  whose  values  form  a  geometrical  progression  and 
which  lie  on  the  curve 

y=  _ = _ =  e~ax. 

(Math.  Trip.  1912.) 

3.  Integrals  containing  the  exponential  function.  Prove  that 


eax  cos  bx  dx  = 


/ 


eax  sin  bx  dx  = 


a  cos  bx  +  b  sin  bx 
a 2  +  b2 

a  sin  bx  —  b  cos  bx 
a 2  +  b2 


[Denoting  the  two  integrals  by  I ,  J,  and  integrating  by  parts,  we  obtain 
al  =  eax  cos  bx  +  bJ,  aJ  =  eax  sin  bx  —  bl. 


Solve  these  equations  for  I  and  J.] 

4.  Prove  that  the  successive  areas  bounded  by  the  curve  of  Ex.  2  and  the 
positive  half  of  the  axis  of  x  form  a  geometrical  progression,  and  that  their  sum 

is 

b  1  +  e~an/b 

a2  +  b2  1  -  e~anlh ' 

5.  Prove  that  if  a  >  0  then 

a  r°°  h 

e~ax  cos  bx  dx  =  — - ,  /  e~ax  sin  bx  dx  =  — - - . 

a2  +  b2  J  o  a2  +  b2 

6.  If  In  =  J  eaxxndx  then  aln  =  eaxxn  —  nIn-\.  [Integrate  by  parts.  It 

follows  that  In  can  be  calculated  for  all  positive  integral  values  of  n.] 
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7.  Prove  that,  if  n  is  a  positive  integer,  then 


and 


dx  =  n\e  ^ 


e 

2! 


e\ 

n\ ) 


dx  =  n\. 


8.  Show  how  to  find  the  integral  of  any  rational  function  of  ex.  [Put 
x  =  log  u,  when  ex  =  u,  dx/du  =  1/u,  and  the  integral  is  transformed  into  that 
of  a  rational  function  of  u.] 

9.  Integrate 

e2x 

( c2ex  +  a2e~x)(c2ex  +  b2e~x)  ’ 

distinguishing  the  cases  in  which  a  is  and  is  not  equal  to  b. 

10.  Prove  that  we  can  integrate  any  function  of  the  form  P(x,  eax ,  ebx, . . . ), 
where  P  denotes  a  polynomial.  [This  follows  from  the  fact  that  P  can  be  ex¬ 
pressed  as  the  sum  of  a  number  of  terms  of  the  type  Axmekx ,  where  m  is  a 
positive  integer.] 

11.  Show  how  to  integrate  any  function  of  the  form 


P(x,  eax,  ebx,  ...,  cos  lx,  cos  mx,  ...,  sin^x,  sinmx,  ...). 


12.  Prove  that  /  e  xR(x )  dx,  where  A  >  0  and  a  is  greater  than  the 
J  a 

greatest  root  of  the  denominator  of  R(x),  is  convergent.  [This  follows  from  the 
fact  that  eXx  tends  to  infinity  more  rapidly  than  any  power  of  x.] 

/°°  2 

e~Xx  +fMX  dx,  where  A  >  0,  is  convergent  for  all  values 

-OO 

POO 

of  i-i ,  and  that  the  same  is  true  of  /  e~Xx  +^xxn  dx,  where  n  is  any  positive 


integer. 

2  2  2  2 

14.  Draw  the  graphs  of  ex  ,  e~x  ,  xex,  xe~x ,  xex  ,  xe~x  ,  and  xlogx,  de¬ 
termining  any  maxima  and  minima  of  the  functions  and  any  points  of  inflexion 
on  their  graphs. 

15.  Show  that  the  equation  eax  =  bx,  where  a  and  b  are  positive,  has  two 
real  roots,  one,  or  none,  according  as  b  >  ae,  b  =  ae,  or  b  <  ae.  [The  tangent  to 
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the  curve  y  =  eax  at  the  point  (£,  ea^)  is 

y  —  ea^  =  aea^(x  —  £), 

which  passes  through  the  origin  if  a£  =  1,  so  that  the  line  y  =  aex  touches  the 
curve  at  the  point  (1/a,  e).  The  result  now  becomes  obvious  when  we  draw  the 
line  y  =  bx.  The  reader  should  discuss  the  cases  in  which  a  or  b  or  both  are 
negative.] 

16.  Show  that  the  equation  ex  =  1  +  x  has  no  real  root  except  x  =  0,  and 
that  ex  =  1  T  x  T  \x2  has  three  real  roots. 

17.  Draw  the  graphs  of  the  functions 

log(x  +  V x2  +  1),  log  X^j  j  e~ax  cos2  bx, 
g-!1/^)2  yj\fx,  e~cotx,  e~cot2x. 

18.  Determine  roughly  the  positions  of  the  real  roots  of  the  equations 

log(x+i/x2~+T)  =  ^  ex  sin x  =  7,  ex2  sin.T  =  10,000. 

100  2  —  x  10,000 

19.  The  hyperbolic  functions.  The  hyperbolic  functions  coshx,* 
sinhx,  . . .  are  defined  by  the  equations 

cosh  x  =  \{ex  +  e~x ),  sinh  x  =  \{ex  —  e~x ), 
tanh.x  =  (sinh  x) /(cosh  x) ,  cothx  =  (cosh  x)/ (sinh  x), 
sech  x  =  1  /  (cosh  x) ,  cosech  x  =  1/  (sinh  x) . 

Draw  the  graphs  of  these  functions. 

20.  Establish  the  formulae 

cosh(— x)  =  coshx,  sinh(— x)  =  —  sinh  a;,  tanh(— x)  =  —  tanhx, 
cosh2  x  —  sinh2  x  =  1,  sech2  x  +  tanh2  x  =  1,  coth2  x  —  cosech2  x  =  1, 
cosh  2x  =  cosh2  x  +  sinh2  x.  sinh  2x  =  2  sinh  x  cosh  x, 

"“Hyperbolic  cosine’:  for  an  explanation  of  this  phrase  see  Hobson’s  Trigonometry , 
ch.  XVI. 
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cosh(x  +  y)  =  cosh  x  cosh  y  +  sinh  x  sinh  y, 
sinh(x  +  y)  =  sinh  x  cosh  y  +  cosh  x  sinh  y. 

21.  Verify  that  these  formulae  may  be  deduced  from  the  corresponding  for¬ 
mulae  in  cosx  and  sinx,  by  writing  coshx  for  cosx  and  i  sinh  x  for  sinx. 

[It  follows  that  the  same  is  true  of  all  the  formulae  involving  cos  nx  and 
sin  nx  which  are  deduced  from  the  corresponding  elementary  properties  of  cos  x 
and  sinx.  The  reason  of  this  analogy  will  appear  in  Ch.  X.] 

22.  Express  coshx  and  sinhx  in  terms  (a)  of  cosh2x  ( b )  of  sinh2x.  Discuss 

any  ambiguities  of  sign  that  may  occur.  (Math.  Trip.  1908.) 

23.  Prove  that 


Dx  cosh  x  =  sinh  x,  Dx  sinh  x  =  cosh  x, 

Dx  tanh  x  =  sech2  x,  Dx  coth  x  =  —  cosech2  x, 

Dx  sech  x  =  —  sech  x  tanh  x,  Dx  cosech  x  =  —  cosech  x  coth  x, 

Dx  log  cosh  x  =  tanh  x,  Dx  log  |  sinh  x|  =  coth  x, 

Dx  arc  tan  ex  =  \  sech  x,  Dx  log  |  tanh  \x\=  cosech  x. 

[All  these  formulae  may  of  course  be  transformed  into  formulae  in  integra¬ 
tion.] 

24.  Prove  that  coshx  >  1  and  —  1  <  tanhx  <  1. 

25.  Prove  that  if  y  =  coshx  then  x  =  log{y  ±  \Jy 2  —  1},  if  y  =  sinhx  then 
x  =  logjy  +  y fy2  +  1},  and  if  y  =  tanhx  then  x  =  |log{(l  +  y)/(  1  —  y)}. 
Account  for  the  ambiguity  of  sign  in  the  first  case. 

26.  We  shall  denote  the  functions  inverse  to  coshx,  sinhx,  tanhx  by 
arg coshx,  arg sinhx,  arg tanhx.  Show  that  arg coshx  is  defined  only  when 
x  ^  1,  and  is  in  general  two- valued,  while  arg  sinhx  is  defined  for  all  real  values 
of  x,  and  arg  tanhx  when  —  1  <  x  <  1,  and  both  of  the  two  latter  functions  are 
one-valued.  Sketch  the  graphs  of  the  functions. 

27.  Show  that  if  —  <  x  <  and  y  is  positive,  and  cos x coshy  =  1,  then 


y  =  log(sec  x  +  tanx),  Dxy  =  secx,  Dyx  =  sec\iy. 

dx 


28.  Prove  that  if  a  >  0  then 


Vx2  + ' 


=  argsinh(x/a),  and 


dx 


y/x2  — 


is  equal  to  argcosh(x/a)  or  to  —  argcosh(— x/a),  according  as  x  >  0  or  x  <  0. 
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f  dx 

29.  Prove  that  if  a  >  0  then  /  =  is  equal  to  —(1/a)  argtanh(x7a)  or 

J  xz  —  a- 

to  —  (1/a)  argcoth(x/a),  according  as  |x|  is  less  than  or  greater  than  a.  [The 
results  of  Exs.  28  and  29  furnish  us  with  an  alternative  method  of  writing  a  good 
many  of  the  formulae  of  Ch.  VI.] 

30.  Prove  that 

=  2  log{  y/x  —  a  +  yj  x  —  b} 

x  —  a)  (x  —  b) 

=====  =  -2  log  {y/a-x+  Vb-x} 
a  —  x)(b  —  x) 

/dx  lx  —  a 

— .  =  2  arc  tan  i  /  - - 

y/(x  —  a)(b  —  x)  V  b  —  x 


(a  <  b  <  x), 
(x  <  a  <  b), 
(a  <  x  <  b). 


31.  Prove  that 

f  x  log(l  +  |x)  dx  =  |  —  |  log  |  <  |  [  x2  dx  =  g. 

Jo  ~  J  0 

(Math.  Trip.  1913.) 

32.  Solve  the  equation  a  cosh  x  +  6sinhx  =  c,  where  c  >  0,  showing  that  it 
has  no  real  roots  if  b2  +  c2  —  a2  <  0,  while  if  b2  +  c2  —  a2  >  0  it  has  two,  one,  or 
no  real  roots  according  as  a  +  b  and  a  —  b  are  both  positive,  of  opposite  signs, 
or  both  negative.  Discuss  the  case  in  which  b2  +  c2  —  a2  =  0. 

33.  Solve  the  simultaneous  equations  cosh  x  cosh  y  =  a,  sinli  x  sinh  y  =  b. 

34.  xl/x  — >  1  as  x  — >  oo.  [For  xx!x  =  e(log^/x,  and  (log x)/x  — >  0.  Cf. 
Ex.  xxvii.  11.]  Show  also  that  the  function  x l^x  has  a  maximum  when  x  =  e, 
and  draw  the  graph  of  the  function  for  positive  values  of  x. 

35.  xx  — >  1  as  x  — »•  +0. 

36.  If  {f(n  +  l)}/{/(n)}  — >  l,  where  l  >  0,  as  n  -*  oo,  then  y/fjn)  — >  l. 
[For  log f(n  +  1)  —  log f(n)  -H  log l,  and  so  (l/n)log/(n)  — >  log/  (Ch.  IV, 
Misc.  Ex.  27).] 

37.  yfn\/n  — >  1/e  as  n  — >  oo. 

[If  f(n)  =  n~nn\  then  {/(n  +  l)}/{/(n)}  =  {1  +  (l/n)}_n  — >  1/e.  Now  use 
Ex.  36.] 

38.  ^(2 n)\/(n\)2  — >  4  as  n  — >  oo. 
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39.  Discuss  the  approximate  solution  of  the  equation  ex  =  x1,000’000. 

[It  is  easy  to  see  by  general  graphical  considerations  that  the  equation  has 
two  positive  roots,  one  a  little  greater  than  1  and  one  very  large,*  and  one 
negative  root  a  little  greater  than  —1.  To  determine  roughly  the  size  of  the 
large  positive  root  we  may  proceed  as  follows.  If  ex  =  x1,000>000  then 


x  =  106logx,  logic 


13.82  + log  logic,  log  log  x 


2.63  + log 


/  log  logaA 
V  13.82  ) 


roughly,  since  13.82  and  2.63  are  approximate  values  of  log  106  and  log  log  106 
respectively.  It  is  easy  to  see  from  these  equations  that  the  ratios  logx  :  13.82 
and  log  logx  :  2.63  do  not  differ  greatly  from  unity,  and  that 

x  =  106(13.82  +  log  logx)  =  106(13.82  +  2.63)  =  16,450,000 


gives  a  tolerable  approximation  to  the  root,  the  error  involved  being  roughly 
measured  by  106(loglogx  —  2.63)  or  (106loglogx)/13.82  or  (106  x  2.63) /13.82, 
which  is  less  than  200,000.  The  approximations  are  of  course  very  rough,  but 
suffice  to  give  us  a  good  idea  of  the  scale  of  magnitude  of  the  root.] 

40.  Discuss  similarly  the  equations 

ex  =  l^OOTOOx1’000’000,  ex2  =  x1’000-000-000. 


211.  Logarithmic  tests  of  convergence  for  series  and  integrals. 

We  showed  in  Ch.  VIII  (§§  175  et  seq .)  that 


(a  >  0) 


are  convergent  if  s  >  1  and  divergent  if  s  ^  1.  Thus  ^(1/ n)  is  divergent, 
but  J+n+1_“  is  convergent  for  all  positive  values  of  a. 

*The  phrase  ‘very  large’  is  of  course  not  used  here  in  the  technical  sense  explained  in 
Ch.  IV.  It  means  ‘a  good  deal  larger  than  the  roots  of  such  equations  as  usually  occur 
in  elementary  mathematics’.  The  phrase  ‘a  little  greater  than’  must  be  interpreted 
similarly. 
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We  saw  however  in  §  200  that  with  the  aid  of  logarithms  we  can  con¬ 
struct  functions  which  tend  to  zero,  as  n  — y  oo,  more  rapidly  than  1/n,  yet 
less  rapidly  than  n~1_a,  however  small  a  may  be,  provided  of  course  that 
it  is  positive.  For  example  l/{n\ogn)  is  such  a  function,  and  the  question 
as  to  whether  the  series 


1 

n  log  n 


is  convergent  or  divergent  cannot  be  settled  by  comparison  with  any  series 
of  the  type  n~s ■ 

The  same  is  true  of  such  series  as 


1 

n(logn)2  ’ 


log  log  n 
nydogn 


It  is  a  question  of  some  interest  to  find  tests  which  shall  enable  us  to  decide 
whether  series  such  as  these  are  convergent  or  divergent;  and  such  tests  are 
easily  deduced  from  the  Integral  Test  of  §  174. 

For  since 


Lb  (log  x)1  s 


1  —  s 

x(logx)s  ’ 


Dx  log  log  x 


1 

X  log  X  ’ 


we  have 


dx 

x(logx)s 


(logQ1  5  ~  (log a)1  * 

1  —  s 


dx 

x  logx 


log  log  f  -  log  log  a, 


if  a  >  1.  The  first  integral  tends  to  the  limit  —(log  a)l~s / (1  —  s)  as  £  — >  oo, 
if  s  >  1,  and  to  oo  if  s  <  1.  The  second  integral  tends  to  oo.  Hence  the 
series  and  integral 


X 


1 

n(logn)s  ’ 


dx 


x(logx)s  ’ 


where  n0  and  a  are  greater  than  unity,  are  convergent  if  s  >  1,  divergent 
if  1. 
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It  follows,  of  course,  that  ^  0(n)  is  convergent  if  0(n)  is  positive  and 
less  than  K/{n(logn)s},  where  s  >  1,  for  all  values  of  n  greater  than 
some  definite  value,  and  divergent  if  4>{n)  is  positive  and  greater  than 
K/  (nlogn)  for  all  values  of  n  greater  than  some  definite  value.  And  there 
is  a  corresponding  theorem  for  integrals  which  we  may  leave  to  the  reader. 


Examples  LXXXVIII. 


X 


1 

n(logn)2  ’ 


X 


1.  The  series 

(log  n) 100  n2  —  1  1 

nioi/ioo  ’  n2  +  1  n(log?i)7/6 


are  convergent.  [The  convergence  of  the  first  series  is  a  direct  consequence  of  the 
theorem  of  the  preceding  section.  That  of  the  second  follows  from  the  fact  that 
(log  n) 100  is  less  than  vP  for  sufficiently  large  values  of  n,  however  small  /3  may 
be,  provided  that  it  is  positive.  And  so,  taking  /3  =  1/200,  (logn)100n~101/100 
is  less  than  n~201/200  for  sufficiently  large  values  of  n.  The  convergence  of  the 
third  series  follows  from  the  comparison  test  at  the  end  of  the  last  section.] 

2.  The  series 


y _ I _ 

n(logn)6/7  ’ 


X 


1 

n100/101  ("fog  n)  100  ’ 


X 


nlogn 

(nlog?r)2  +  1 


are  divergent. 

3.  The  series 


E(logn)p  ^  (logn)p(loglogn)9  ^  (loglogn)p 
n1+s  ’  n1+s  ’  n(logn)1+s  ’ 


where  s  >  0,  are  convergent  for  all  values  of  p  and  q ;  similarly  the  series 


y _ I _ , 

n1-s(logn)P’ 


X 


1 

n1-s(logn)P(log  logn)1?  ’ 


X 


1 

n(log  n) 1-5  (log  log  n)P 


are  divergent. 

4.  The  question  of  the  convergence  or  divergence  of  such  series  as 


X 


1 

n  log  n  log  log  n  ’ 


X 


log  log  log  n 
n  log  ny/  log  logn 


cannot  be  settled  by  the  theorem  of  p.  467,  since  in  each  case  the  function  under 
the  sign  of  summation  tends  to  zero  more  rapidly  than  l/(n  log  n)  yet  less  rapidly 


[IX  :  211]  THE  LOGARITHMIC  AND  EXPONENTIAL  FUNCTIONS 


469 


than  n_1(logn)_1~a,  where  a  is  any  positive  number  however  small.  For  such 
series  we  need  a  still  more  delicate  test.  The  reader  should  be  able,  starting 
from  the  equations 


D. rOogfcX)1  S 


Dx  logfc+i  X 


1  —  s 

X  log  X  log2  X  . . .  logfe_1  x(logfc  x)s  ’ 

1 

X  log  X  log2  X  .  .  .  log,,.!  X  logfc  X  ’ 


where  log2  x  =  log  log  x,  log3x  =  log  log  log  x,  . . . ,  to  prove  the  following  theo¬ 
rem:  the  series  and  integral 


X 


1 

n  log  n  log2  n  . . .  logfc_!  n(logfc  n)s  ’ 


dx 

x  log  x  log2  x  . . .  logk_1  x(logk  x)s 


are  convergent  if  s  >  1  and  divergent  if  s  X  1 ,  no  and  a  being  any  numbers 
sufficiently  great  to  ensure  that  logfc  n  and  logfc  x  are  positive  when  n  X  no 
or  x  X  a.  These  values  of  no  and  a  increase  very  rapidly  as  k  increases: 
thus  logs  >  0  requires  x  >  1,  log2  x  >  0  requires  x  >  e,  log3.T  >  0  requires 
x  >  ee,  and  so  on;  and  it  is  easy  to  see  that  ee  >  10,  ee<!  >  e10  >  20,000, 

gee<!  g20,000  ^  -^q8000 

The  reader  should  observe  the  extreme  rapidity  with  which  the  higher  ex¬ 
ponential  functions,  such  as  eeX  and  ee<:  ,  increase  with  x.  The  same  remark  of 

x  ax 

course  applies  to  such  functions  as  aa  and  aa  ,  where  a  has  any  value  greater 
than  unity.  It  has  been  computed  that  99)  has  369,693,100  figures,  while  1010l° 
has  of  course  10,000,000,000.  Conversely,  the  rate  of  increase  of  the  higher  log¬ 
arithmic  functions  is  extremely  slow.  Thus  to  make  log  log  log  log  i>lwe  have 
to  suppose  x  a  number  with  over  8000  figures.* 

fa  If  /  1  \  1  s 

5.  Prove  that  the  integral  /  —  <  log  —  >  dx,  where  0  <  a  <  1,  is 


Jo  x  {  \x /  ) 

convergent  if  s  <  —  1,  divergent  if  s  X  —1.  [Consider  the  behaviour  of 


dx 


as  e  ->  +0.  This  result  also  may  be  refined  upon  by  the  introduction  of  higher 
logarithmic  factors.] 


See  the  footnote  to  p.  450. 
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1 


6.  Prove  that  /  —  <  log  (  —  >  dx  has  no  meaning  for  any  value  of  s. 

Jo  x  {  \xj  J 

[The  last  example  shows  that  s  <  —  1  is  a  necessary  condition  for  convergence 
at  the  lower  limit:  but  {log(l/x)}s  tends  to  oo  like  (1  —  x)s,  as  x  — >  1  —  0,  if  s  is 
negative,  and  so  the  integral  diverges  at  the  upper  limit  when  s  <  —1.] 

7.  The  necessary  and  sufficient  conditions  for  the  convergence  of 

'1' 


—  1 


log 


dx  are  a  >  0,  s  >  —1. 


Examples  LXXXIX.  1.  Euler’s  limit.  Show  that 


0(n) 


1  1 

1+2+3+ 


+ 


n  —  1 


logn 


tends  to  a  limit  7  as  n  — >  00,  and  that  0  <  7  ^  1.  [This  follows  at  once 
from  §  174.  The  value  of  7  is  in  fact  .577 . . . ,  and  7  is  usually  called  Euler’s 
constant.] 

2.  If  a  and  6  are  positive  then 


111  1 

a  a  +  b  a  +  2b  a  +  (n  —  1)6 


1 

6 


log(a  +  nb) 


tends  to  a  limit  as  n  — >  00. 

3.  If  0  <  s  <  1  then 

4>(n)  =  1  +  2"s  +  3"s  +  •  •  •  +  (n  -  l)~s  - 


tends  to  a  limit  as  n  — >  00. 

4.  Show  that  the  series 


1  1  1 

1  2(1  +  2)  3(1  +  \  +  |) 


is  divergent.  [Compare  the  general  term  of  the  series  with  l/(nlogn).]  Show 
also  that  the  series  derived  from  ^  n~s ,  in  the  same  way  that  the  above  series 
is  derived  from  X^(Vn)>  is  convergent  if  s  >  1  and  otherwise  divergent. 

5.  Prove  generally  that  if  un  is  a  series  of  positive  terms,  and 


Sn  =  Ul  +  U2  H - 1 -Un 
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then  ^2(un/ sn-i)  is  convergent  or  divergent  according  as  J2un  is  convergent  or 
divergent.  [If  Ylun  is  convergent  then  sn_i  tends  to  a  positive  limit  l,  and  so 
Y^{un/sn- 1)  is  convergent.  If  Ylun  is  divergent  then  sn-\  — >  oo,  and 

Un/sn- 1  >  log{l  +  (un/sn- 1)}  =  log(sn/sn_i ) 

(Ex.  LXXXII.  1);  and  it  is  evident  that 

log(s2/si)  +  log(s3/s2)  H - 1-  log(sn/sn_i)  =  log(sn/si) 

tends  to  oo  as  n  — >  oo.[ 

6.  Prove  that  the  same  result  holds  for  the  series  Yl(un/ sn)-  [The  proof  is 
the  same  in  the  case  of  convergence.  If  Ylun  is  divergent,  and  un  <  sn_i  from 
a  certain  value  of  n  onwards,  then  sn  <  2sn_i,  and  the  divergence  of  YKun/sn ) 
follows  from  that  of  Y2(un/sn- 1)-  If  on  the  other  hand  un  ^  sn_i  for  an  infinity 
of  values  of  n,  as  might  happen  with  a  rapidly  divergent  series,  then  un/ sn  ^  ^ 
for  all  these  values  of  n.] 

7.  Sum  the  series  1  —  \  +  |  —  . . . .  [We  have 

l+\+-  •  -+^  =  log(2n+l)+7+en,  2  Q  +  |  +  ■  ■  ■  +  =  \og(n+l)+j+e'n , 

by  Ex.  1,  7  denoting  Euler’s  constant,  and  en,  e'n  being  numbers  which  tend  to 
zero  as  n  — >  oo.  Subtracting  and  making  n  — >  oo  we  see  that  the  sum  of  the 
given  series  is  log  2.  See  also  §  213.] 

8.  Prove  that  the  series 

oscillates  finitely  except  when  (7  =  7,  when  it  converges. 


212.  Series  connected  with  the  exponential  and  logarithmic 
functions.  Expansion  of  ex  by  Taylor’s  Theorem.  Since  all  the 
derivatives  of  the  exponential  function  are  equal  to  the  function  itself,  we 
have 


ex  =  l  +  x+  —  +  •••  + 


x 


n—  1 


Xn 


(n  —  1)!  n\ 


Ox 
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where  0  <  6  <  1.  But  xn/n\  -A  0  as  n  — »  oo,  whatever  be  the  value  of  x 
(Ex.  xxvil.  12);  and  e6x  <  ex.  Hence,  making  n  tend  to  oo,  we  have 


x 


x 


ex  =  l  +  x+  —  +  •••  +  —  + 


2! 


nl 


(1) 


The  series  on  the  right-hand  side  of  this  equation  is  known  as  the  ex¬ 
ponential  series.  In  particular  we  have 


e=1+1+^+ 


H - :  +  •••; 

nl 


and  so 


i  +  i  +  l  +  ---  +  l  + 


x2  xn 

=  1  +  X  +  —  +  —  + 

2!  nl 


a  result  known  as  the  exponential  theorem.  Also 


ax  =  exloga  ~ 


=  i  +  (iioga)  +  Eh|h!  +  . 


2! 


(2) 


(3) 


(4) 


for  all  positive  values  of  a. 

The  reader  will  observe  that  the  exponential  series  has  the  property  of  repro¬ 
ducing  itself  when  every  term  is  differentiated,  and  that  no  other  series  of  powers 
of  x  would  possess  this  property:  for  some  further  remarks  in  this  connection 
see  Appendix  II. 

The  power  series  for  ex  is  so  important  that  it  is  worth  while  to  investigate 
it  by  an  alternative  method  which  does  not  depend  upon  Taylor’s  Theorem.  Let 

/y>2  rpfl 

En(x)  =  1  +  x  +  —  +  ■■•-!  r, 

2!  nl 

and  suppose  that  x  >  0.  Then 


l  +  ~r  =  l  +  n(-)  + 


n 


n 


n{n  —  1) 

1  •  2 


x-)\ 

nJ 


+ 


n(n  —  1) ...  1  / x\ 71 


n 


which  is  less  than  En(x).  And,  provided  n  >  x,  we  have  also,  by  the  binomial 
theorem  for  a  negative  integral  exponent, 


1  -  - 
n 


~n  _  /x\  n(n  +  l)  fx\2 

=  1  +  n  -  +  v  ’  1  ' 


n 


1  •  2 


(J  +'"  >En^' 
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Thus 


(I  +  =)■<*(*)  <(!-=)' 


But  (§  208)  the  first  and  last  functions  tend  to  the  limit  ex  as  n  — >  oo, 
and  therefore  En{x)  must  do  the  same.  From  this  the  equation  (1)  follows 
when  x  is  positive;  its  truth  when  x  is  negative  follows  from  the  fact  that 
the  exponential  series,  as  was  shown  in  Ex.  LXXXI.  7,  satisfies  the  functional 
equation  f(x)f(y)  =  f(x  +  y),  so  that  f(x)f(-x)  =  /( 0)  =  1. 


Examples  XC.  1.  Show  that 


cosh  x  =  l  +  —  +  —  +  ...,  sinh  x  =  x~l~:3[~,~:5[  + - 


2.  If  x  is  positive  then  the  greatest  term  in  the  exponential  series  is  the 
([a;]  +  l)-th,  unless  x  is  an  integer,  when  the  preceding  term  is  equal  to  it. 

3.  Show  that  n\  >  ( n/e)n .  [For  nn/n\  is  one  term  in  the  series  for  en.\ 

4.  Prove  that  en  =  (nn/n\)( 2  +  Si  +  £2),  where 

5l  =  rr^  +  (i+^)(i+2Z,)  +  ---’  52  =  (i-^)  +  (i-^)(i-2z,)  +  ..., 


and  v  =  1/n;  and  deduce  that  n!  lies  between  2 (n/e)n  and  2 (n  +  l)(n/e)n. 

5.  Employ  the  exponential  series  to  prove  that  ex  tends  to  infinity  more 
rapidly  than  any  power  of  x.  [Use  the  inequality  ex  >  xn/n\.] 

6.  Show  that  e  is  not  a  rational  number.  [If  e  =  p/q,  where  p  and  q  are 
integers,  we  must  have 


P 

Q 


1  1  1 
1  +  1+2!  +  3!+"'  +  d+'-- 


or,  multiplying  up  by  g!, 


I 


1\  _  _J_  1 

q'-J  q  + 1  (9 +  !)(«  + 2)  +'" 


and  this  is  absurd,  since  the  left-hand  side  is  integral,  and  the  right-hand  side 
less  than  {1  /{q  +  1)}  +  {1  /{q  +  l)}2  H - =  1  /q.} 
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7.  Sum  the  series  Y)  Pr(n ) — where  Pr(n )  is  a  polynomial  of  degree  r  in  n. 
o  nl 

[We  can  express  Pr(n)  in  the  form 


Aq  +  Ain  +  A-2n(n  —  !)  +  •••  +  Arn(n  —  1) . . .  (n  —  r  +  1), 


and 


OO  /yi  OO 

X 

[n) — -  = 
n! 

0  0 


/yi  I  v  _ _ _  /yt  /  L 

=a,£yt  +AiE 


+  •  •  •  +  Ar  yy 


x 


=  (Aq  +  A\x  +  A2X 2  +  •  •  •  +  Arxr)ex .] 


8.  Show  that 

OO  3  OO  4 

^  ^-xn  =  (x  +  3x2  +  x3)ex,  ^  +  7x 2  +  6x3  +  x4)ea:; 

1  n'  1  n- 

and  that  if  Sn  =  l3  +  23  +  •  •  •  +  n3  then 

OO  yi 

Sn  — — r  =  i(4x  +  14x2  +  8x3  +  x4)ex. 

^  n\ 

In  particular  the  last  series  is  equal  to  zero  when  x  =  —2.  (Math.  Trip.  1904.) 

9.  Prove  that  Y(n/n0  =  e,  Y(n2/n0  =  2e,  YXn3/nD  =  5e,  and  that 
YXnVra0>  where  k  is  any  positive  integer,  is  a  positive  integral  multiple  of  e. 

OO  ( _  1  A  rp71 

10.  Prove  that  V - '  =  { (x2  —  3x  +  3)ex  +  lx2  —  3)  lx2. 

1  (n  +  2)n!  1  2  J 

[Multiply  numerator  and  denominator  by  n  +  1,  and  proceed  as  in  Ex.  7.] 

11.  Determine  a,  b,  c  so  that  {(x  +  a) ex  +  (bx  +  c)}/x3  tends  to  a  limit  as 

bx  H-  c 

x  — >•  0,  evaluate  the  limit,  and  draw  the  graph  of  the  function  ex  H - . 

x  +  a 

12.  Draw  the  graphs  of  1  +  x,  1  +  x  +  ^ x 2,  1  +  x  +  ^x2  +  |x3,  and  compare 
them  with  that  of  ex. 

13.  Prove  that  e-*  —  1  +  x - -  +  •  •  •  —  (— l)n — -  is  positive  or  negative 

2!  n\ 

according  as  n  is  odd  or  even.  Deduce  the  exponential  theorem. 

14.  If 


X0  =  ex,  Xi  =  e*  -  1,  X2  =  ex  -  1  -  x,  X3  =  ex  -  1  -  x  -  (x2/2!),  . . . , 
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then  dXvjdx  =  X„_i .  Hence  prove  that  if  t  >  0  then 


X\ (t)  =  I  Xq  dxKte*,  Xzit)  =  J  X\dx  <  J  xex  dx  <  e*  J  xdx  =  —et1 


i  o 


tv 

and  generally  Xv(t)  <—et.  Deduce  the  exponential  theorem. 

15.  Show  that  the  expansion  in  powers  of  p  of  the  positive  root  of  x2+p  =  a 2 
begins  with  the  terms 

a{l  -  \p log  a  +  \p2  log  a(2  +  log  a)}. 

(Math.  Trip.  1909.) 


213.  The  logarithmic  series.  Another  very  important  expansion 
in  powers  of  x  is  that  for  log(l  +  x).  Since 


log(l  +  x) 


dt 

1+i’ 


and  1/(1  +  t)  =  1  —  t  +  t2  —  ...  if  t  is  numerically  less  than  unity,  it  is 
natural  to  expect*  that  log(l  +  x)  will  be  equal,  when  —  1  <  x  <  1,  to  the 
series  obtained  by  integrating  each  term  of  the  series  1  —  t  +  t2  —  . . .  from 
t  —  0  to  t  —  x,  i.e.  to  the  series  x  —  \x2  +  |x3  —  . . . .  And  this  is  in  fact 
the  case.  For 

1/(1  +  t)  =  1  -  t  +  t2 - +  (-1  + 

and  so,  if  x  >  —  1, 

PX  nr .2  rpTIT 

log(l  +  x)=  — -  =  x  --  +  ■■■  +  (-1)™-1—  +  (-1  TRm, 

JQ  1  l  Z  TYl 


where 


Rn 


tm  dt 
1  +  t 


See  Appendix  II  for  some  further  remarks  on  this  subject. 
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We  require  to  show  that  the  limit  of  Rrn,  when  m  tends  to  oo,  is  zero. 
This  is  almost  obvious  when  0  <  x  5=  1;  for  then  Rm  is  positive  and  less 
than 


tmdt 

and  therefore  less  than  l/(m  +  1). 
put  t  =  — u  and  x  =  — £,  so  that 


^.m+l 

m  +  1  ’ 

If  on  the  other  hand  —  1  <  x  <  0, 


we 


Rm  =  (-l)r 


p*  Um  du 


1 


u 


which  shows  that  Rm  has  the  sign  of  (— l)m.  Also,  since  the  greatest  value 
of  1/(1  —  it)  in  the  range  of  integration  is  1/(1  —  £),  we  have 


0  <  \RJ  < 


um  du  = 


e 


< 


(m +  !)(!-£)  (m  +  1)(1  —  £) 


and  so  Rm  — >•  0. 

Hence 

log(l  +  x)  =  x  —  \x 2  +  ^x3  —  •  •  •  , 

provided  that  —  1  <  x  ^  1.  Ifx  lies  outside  these  limits  the  series  is  not 
convergent.  If  x  —  1  we  obtain 


log  2  =  1  -  |  +  |  *  ... , 


a  result  already  proved  otherwise  (Ex.  lxxxix.  7). 


214.  The  series  for  the  inverse  tangent.  It  is  easy  to  prove  in  a 
similar  manner  that 

rx  dt  rx 

arc  tan  x  —  I  - -  =  /  (1  —  t2  +  t4  —  . . . )  dt 

Jo  1  +  t  Jo 

-  rp  _  rp  ^  I  rp  _ 

—  2  it  i  ^  it  •  •  •  ^ 

provided  that  —  1  ^  x  ^  1.  The  only  difference  is  that  the  proof  is  a  little 
simpler;  for,  since  arc  tan  x  is  an  odd  function  of  x,  we  need  only  consider 
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positive  values  of  x.  And  the  series  is  convergent  when  x  =  —  1  as  well  as 
when  x  —  1.  We  leave  the  discussion  to  the  reader.  The  value  of  arc  tan  x 
which  is  represented  by  the  series  is  of  course  that  which  lies  between  — 
and  t  when  —  1  A  x  A  1,  and  which  we  saw  in  Ch.  VII  (Ex.  lxiii.  3)  to 
be  the  value  represented  by  the  integral.  If  x  =  1,  we  obtain  the  formula 


1 


,  =  X  +  \x2  +  77  X3  + 

1  —  X  1  Z  6 


Examples  XCI.  1.  log 

2.  argtanhx  =  |  log  ^  |  =  x  +  |x3  +  |x5  +  . . .  if  —  1 

3.  Prove  that  if  x  is  positive  then 


if  —  1  ^  x  <  1. 
— 1  <  x  <  1. 


log(l  +x)  =  — —  +  2 

1  +  X 


X 

1  +  X 


+ 


X 

1  +  X 


+  ... 


(Math.  Trip.  1911.) 

4.  Obtain  the  series  for  log(l  +  x)  and  arc  tan  x  by  means  of  Taylor’s  the¬ 
orem. 

[A  difficulty  presents  itself  in  the  discussion  of  the  remainder  in  the  first 
series  when  x  is  negative,  if  Lagrange’s  form  Rn  =  (— \)n~l xn / {n(l  +  9x)n}  is 
used;  Cauchy’s  form,  viz. 


Rn  =  (-l)n_1(l  -  0)n-lxn/(  1  +  9x)n , 


should  be  used  (cf.  the  corresponding  discussion  for  the  Binomial  Series, 
Ex.  LVI.  2  and  §  163). 

In  the  case  of  the  second  series  we  have 


D™  arc  tan  x  =  D™  1  { 1/(1  +  x2)} 

=  (— 1  )n~l(n  —  l)!(x2  +  l)_ri/2  sin{narctan(l/x)} 

(Ex.  xlv.  11),  and  there  is  no  difficulty  about  the  remainder,  which  is  obviously 
not  greater  in  absolute  value  than  1/n.*] 


*The  formula  for  D"  arctanx  fails  when  x  =  0,  as  arctan(l/x)  is  then  undefined.  It 
is  easy  to  see  (cf.  Ex.  XLV.  11)  that  arctan(l/x)  must  then  be  interpreted  as  meaning 
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5.  If  y  >  0  then 
logy  =  2 


[Use  the  identity  y  =  1  + 


y- i  i  (y-i 
y  +  13  \y  +  l 

y-  i 


+  I  (y~i 


5  \y  +  l 


+  ... 


y  + 1 


l  - 


y-  1 
y  + 1 


.  This  series  may  be  used 


to  calculate  log  2,  a  purpose  for  which  the  series  1  —  5  +  3  —  . .. ,  owing  to  the 
slowness  of  its  convergence,  is  practically  useless.  Put  y  =  2  and  find  log  2  to 
3  places  of  decimals.] 

6.  Find  log  10  to  3  places  of  decimals  from  the  formula 


log  10  =  3  log  2  +  log(l  +  4). 


7.  Prove  that 
x  +  1 


log 


X 


=  2 


+ 


+ 


2x  +  l  3(2x  +  l)3  5(2x  +  l)5 


+  ... 


if  x  >  0,  and  that 
(x  —  l)2(.x  +  2) 


log 


(x  +  l)2(x  —  2) 


=  2 


2  1 
*3-3*  +  3 


xs  —  3x 


1 

+  5 


x6  —  3x 


+  ... 


if  x  >  2.  Given  that  log  2  =  .693  147 1 . . .  and  log  3  =  1.098  612  3  . . . ,  show,  by 
putting  x  =  10  in  the  second  formula,  that  log  11  =  2.397  895 .... 

(Math.  Trip.  1912.) 

8.  Show  that  if  log  2,  log  5,  and  log  11  are  known,  then  the  formula 


log  13  =  3  log  11  +  log  5  —  9  log  2 

gives  log  13  with  an  error  practically  equal  to  .000  15. 

9.  Show  that 


(Math.  Trip.  1910.) 


\  log  2  =  7a  +  56  +  3c,  \  log  3  =  11a  +  86  +  5c,  \  log  5  =  16a  +  126  +  7c, 

where  a  =  argtanh(l/31),  6  =  argtanh(l/49),  c  =  argtanh(l/161). 

[These  formulae  enable  us  to  find  log  2,  log  3,  and  log  5  rapidly  and  with  any 
degree  of  accuracy.] 
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10.  Show  that 


j7r  =  arctan(l/2)  +  arctan(l/3)  =  4arctan(l/5)  —  arc  tan (1/239), 

and  calculate  n  to  6  places  of  decimals. 

11.  Show  that  the  expansion  of  (1  +  x)1+x  in  powers  of  x  begins  with  the 

terms  1  +  x  +  x2  +  l/2x3.  (Math.  Trip.  1910.) 

12.  Show  that 

log10  e  -  y/x(x  +  l)  log10 

approximately,  for  large  values  of  x.  Apply  the  formula,  when  x  =  10,  to  obtain 
an  approximate  value  of  log10  e,  and  estimate  the  accuracy  of  the  result. 

(Math.  Trip.  1910.) 

13.  Show  that 

T^x log  (l^x)  =  x  +  (■*■ +  i) x2  +  (x  +  \  +  s)  +  •  • '  ’ 


if  -1  <  x  <  1.  [Use  Ex.  LXXXI.  2.] 

14.  Using  the  logarithmic  series  and  the  facts  that  log10  2.3758  =  .375  809  9  . . . 

and  log10  e  =  .4343 . . . ,  show  that  an  approximate  solution  of  the  equation 
x  =  100  log10  x  is  237.58121.  (Math.  Trip.  1910.) 

15.  Expand  logcosx  and  log(sinx/x)  in  powers  of  x  as  far  as  x4,  and  verify 
that,  to  this  order, 


log  sin  x  =  log  x  —  log  cos  x  +  log  cos  \x. 


45 


(Math.  Trip.  1908.) 


16.  Show  that 


dt 


Jo  1  + 

if  —  1  <  x  <  1.  Deduce  that 


-  /y>  _  1  ,  19  _ 

^  •  9*^  •  •  • 


1  —  g  +  g  —  •••  —  {tt  T  2  log(\/2  +  1)}/  4V2. 


(Math.  Trip.  1896.) 
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[Proceed  as  in  §  214  and  use  the  result  of  Ex.  XLVlli.  7.] 

17.  Prove  similarly  that 

/'  I  +2  r]f 

g  —  7  +  TT - =  J  Y  _j_  ^4  =  I71-  “  ^  log(\/2  +  l)}/4\/2. 

18.  Prove  generally  that  if  a  and  b  are  positive  integers  then 

11  1  f1  t a~4  dt 

a  a  +  b  +  a  +  2b  J0  1  +  tb  ' 

and  so  that  the  sum  of  the  series  can  be  found.  Calculate  in  this  way  the  sums 
of  1  —  \  +  ^  —  . . .  and  \  -  \  |  — - 

215.  The  Binomial  Series.  We  have  already  (§  163)  investigated 
the  Binomial  Theorem 

(i+«r  =  i  +  (7)«+(™y+..., 

assuming  that  —1  <  x  <  1  and  that  m  is  rational.  When  m  is  irrational 
we  have 


(1  +  x)m  =  emlog(1+a:), 

Dx(  1  +  x)m  =  {m/ (1  +  x)}emlog{1+x)  =  m{  1  +  x)m~\ 

so  that  the  rule  for  the  differentiation  of  (l+a:)m  remains  the  same,  and  the 
proof  of  the  theorem  given  in  §  163  retains  its  validity.  We  shall  not  discuss 
the  question  of  the  convergence  of  the  series  when  x  =  1  or  x  =  —1.* 


Examples  XCII.  1.  Prove  that  if  —  1  <  x  <  1  then 


Vl  +  x2 


=  1  —  - x 1  + 


1  •  3 
2^4 


x4  — 


Vl  —  x2 


=  1  +  -X  + 


1  •  3 
2^4 


x*  +  .. 


*See  Bromwich,  Infinite  Series ,  pp.  150  et  seq.;  Hobson,  Plane  Trigonometry 
(3rd  edition),  p.  271. 
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2.  Approximation  to  quadratic  and  other  surds.  Let  \[M  be  a 

quadratic  surd  whose  numerical  value  is  required.  Let  N2  be  the  square 
nearest  to  M;  and  let  M  =  N2  +  x  or  M  =  N 2  —  x,  x  being  positive.  Since 
x  cannot  be  greater  than  N ,  x/N2  is  comparatively  small  and  the  surd 
VM  =  iv  v T±  (x/N2)  can  be  expressed  in  a  series 


=  N 


1  •  1 
2^4 


which  is  at  any  rate  fairly  rapidly  convergent,  and  may  be  very  rapidly  so.  Thus 


V&7  =  v/64  +  3 


1  •  1 
2^4 


Let  us  consider  the  error  committed  in  taking  8-[|  (the  value  given  by  the  first 
two  terms)  as  an  approximate  value.  After  the  second  term  the  terms  alternate 
in  sign  and  decrease.  Hence  the  error  is  one  of  excess,  and  is  less  than  32/642, 
which  is  less  than  .003. 

3.  If  X  is  small  compared  with  N2  then 

-  x  Nx 

+  X  =  N+4N  +  2(2N2  +  Xy 

the  error  being  of  the  order  x4/IV7.  Apply  the  process  to  \/907. 

[Expanding  by  the  binomial  theorem,  we  have 

2  S 

-  X  X  X 

+  X  =  N  +  2N  ~  8iV3  +  16iV5’ 

the  error  being  less  than  the  numerical  value  of  the  next  term,  viz.  5x4/128IV7. 
Also 

Nx  X  /  X  \-!  X  x2  X 3 

2(2 N2  +  x)  ~  W  V  +  2N2)  ~  W  ~  8 W  +  16 N5  ’ 

the  error  being  less  than  x4/32Ar'.  The  result  follows.  The  same  method  may 
be  applied  to  surds  other  than  quadratic  surds,  e.g.  to  v^1031.] 

4.  If  M  differs  from  N 3  by  less  than  1  per  cent,  of  either  then  differs 

from  | IV  +  |( M/N 2)  by  less  than  IV/90,000.  (Math.  Trip.  1882.) 
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5.  If  M  =  N4  +  x,  and  x  is  small  compared  with  N,  then  a  good  approxi¬ 
mation  for  \/~M  is 

51  5  M  27 Nx 

— N  - 1 - 

56  56  N3  14(7M  +  5A4) ' 

Show  that  when  N  =  10,  x  =  1,  this  approximation  is  accurate  to  16  places  of 
decimals.  {Math.  Trip.  1886.) 

6.  Show  how  to  sum  the  series 


where  Pr(n )  is  a  polynomial  of  degree  r  in  n. 

[Express  Pr(n )  in  the  form  Aq  +  A\n  +  A^nfji  —  1)  +  . . .  as  in  Ex.  xc.  7.] 

oo  /  txd  /  jYb\ 

7.  Sum  the  series  Y  n  (  )xn ,  Y  n2  (  )xn  and  prove  that 

o  W  o  K'n-J 


OO 


X 


n 


o 


{m3x3  +  m(3m  —  l)x2  +  mx}{  1  +  x)m  3. 


216.  An  alternative  method  of  development  of  the  theory 
of  the  exponential  and  logarithmic  functions.  We  shall  now  give 
an  outline  of  a  method  of  investigation  of  the  properties  of  ex  and  log  x  entirely 
different  in  logical  order  from  that  followed  in  the  preceding  pages.  This  method 

x2 

starts  from  the  exponential  series  1  +  x  +  —  +  ... .  We  know  that  this  series  is 
convergent  for  all  values  of  x,  and  we  may  therefore  define  the  function  exp.r  by 
the  equation 

x2  .  . 

expx  =  1  +  x  +  —  + -  (1) 

We  then  prove,  as  in  Ex.  Lxxxi.  7,  that 

expx  X  expy  =  exp(x  +  y).  (2) 


Again 


exp  h  —  1  „  h  h 2  _  , , . 

— -  =  1  +  2!  +  3!+“  + 


h 
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where  p(h)  is  numerically  less  than 


|  \h\  +  \\h\~  +  \\h\^  +  •  •  •  —  |^/i|/(l  —  | \h\), 
so  that  p(h )  — >  0  as  h  — >  0.  And  so 


exp(x  +  h)  —  exp  x 

h 


=  exp  x 


exp  h—  1 

h 


expx 


as  h  — >  0,  or 


Dx  exp  x  =  exp  x. 


(3) 


Incidentally  we  have  proved  that  exp  x  is  a  continuous  function. 

We  have  now  a  choice  of  procedure.  Writing  y  =  expx  and  observing  that 
expO  =  1,  we  have 

dy  f  y  dt 

Tx=y ’  X  =  J1  7’ 

and,  if  we  define  the  logarithmic  function  as  the  function  inverse  to  the  expo¬ 
nential  function,  we  are  brought  back  to  the  point  of  view  adopted  earlier  in  this 
chapter. 

But  we  may  proceed  differently.  From  (2)  it  follows  that  if  n  is  a  positive 
integer  then 

(exp.x)n  =  expnx,  (exp  l)n  =  expn. 

If  x  is  a  positive  rational  fraction  m/n,  then 


{exp(?n/n)}n  =  expm  =  (expl)m, 

and  so  exp  (m/n)  is  equal  to  the  positive  value  of  (expl)m/n.  This  result  may 
be  extended  to  negative  rational  values  of  x  by  means  of  the  equation 


expxexp(— x)  =  1; 


and  so  we  have 


expx  =  (exp  l)x  =  ex, 


say,  where 

e  =  exp  1  =  1  +  1  +  ^  +  ^  +  ..., 

for  all  rational  values  of  x.  Finally  we  define  ex,  when  x  is  irrational,  as  being 
equal  to  expx.  The  logarithm  is  then  defined  as  the  function  inverse  to  expx 
or  ex. 
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Example.  Develop  the  theory  of  the  binomial  series 

1  +  (T)  X  +  (*2  )  x2  +  ■■■  =  f(m > 

where  —  1  <  x  <  1,  in  a  similar  manner,  starting  from  the  equation 

f(m,  x)f(m!,  x )  =  f(m  +  m' ,  x) 


(Ex.  LXXXI.  6). 

MISCELLANEOUS  EXAMPLES  ON  CHAPTER  IX* 

1.  Given  that  log10  e  =  .4343  and  that  210  and  321  are  nearly  equal  to 
powers  of  10,  calculate  log10  2  and  log10  3  to  four  places  of  decimals. 

(Math.  Trip.  1905.) 

2.  Determine  which  of  (^e)^  and  (v/2)5,r  is  the  greater.  [Take  logarithms 
and  observe  that  \/3/(\/3  +  j7r)  <  |\/3  <  .6929  <  log  2.] 

3.  Show  that  log10  n  cannot  be  a  rational  number  if  n  is  any  positive 
integer  not  a  power  of  10.  [If  n  is  not  divisible  by  10,  and  log10n  =  p/q ,  we 
have  10p  =  nq,  which  is  impossible,  since  10p  ends  with  0  and  nq  does  not.  If 
n  =  10“ N,  where  N  is  not  divisible  by  10,  then  log10  N  and  therefore 

logio  n  =  a  +  log10  N 


cannot  be  rational.] 

4.  For  what  values  of  x  are  the  functions  logx,  log  log  x,  log  log  log  x,  ... 
(a)  equal  to  0  ( b )  equal  to  1  (c)  not  defined?  Consider  also  the  same  question 
for  the  functions  lx,  llx,  lllx,  . . . ,  where  lx  =  log  |x|. 

5.  Show  that 

logx  -  log  (a;  +  1)  +  (^j  log(x  +  2) - f  (-1)"  log(x  +  n) 

is  negative  and  increases  steadily  towards  0  as  x  increases  from  0  towards  oo. 

*  A  considerable  number  of  these  examples  are  taken  from  Bromwich’s  Infinite  Series. 
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[The  derivative  of  the  function  is 


yv_i)r(v)— = _ - _ , 

\rj  x  +  r  x(x  +  1) . . .  (x  +  n) 


as  is  easily  seen  by  splitting  up  the  right-hand  side  into  partial  fractions.  This 
expression  is  positive,  and  the  function  itself  tends  to  zero  as  x  — >  oo,  since 


log(x  +  r)  =  logx  +  ex, 


where  ex  — >  0,  and  1 
6.  Prove  that 


0.] 


d_ 

dx 


logx  (— l)nra! 


rn+ 1 


log  x  -  1  -  - 


(Math.  Trip.  1909.) 

7.  If  x  >  —1  then  x2  >  (1  +  ,x){log(l  +  x)}2.  (Math.  Trip.  1906.) 

[Put  1  +  x  =  e^,  and  use  the  fact  that  sinh£  >  £  when  £  >  0.] 

8.  Show  that  {log(l+.T)}/x  and  x/{(l+x)  log(l+x)}  both  decrease  steadily 
as  x  increases  from  0  towards  oo. 

9.  Show  that,  as  x  increases  from  —1  towards  oo,  the  function  (1  +  x )~1/x 
assumes  once  and  only  once  every  value  between  0  and  1.  (Math.  Trip.  1910.) 

10.  Show  that  - — ; - - - >  -  as  x  — >  0. 

log(l  +  x)  x  2 

11.  Show  that  - - - decreases  steadily  from  1  to  0  as  x  increases 

log(l  +  x)  x 

from  —1  towards  oo.  [The  function  is  undefined  when  x  =  0,  but  if  we  attribute 
to  it  the  value  \  when  x  =  0  it  becomes  continuous  for  x  =  0.  Use  Ex.  7  to  show 
that  the  derivative  is  negative.] 

12.  Show  that  the  function  (log£  —  logx)/(£  —  x),  where  £  is  positive,  de¬ 
creases  steadily  as  x  increases  from  0  to  £,  and  find  its  limit  as  x  — >  £. 

13.  Show  that  ex  >  MxN ,  where  M  and  N  are  large  positive  numbers,  if 
x  is  greater  than  the  greater  of  21ogM  and  16N2. 

[It  is  easy  to  prove  that  log  x  <  2y/x;  and  so  the  inequality  given  is  certainly 
satisfied  if 


x  >  log  M  +  2N  y/x, 


[IX  :  216]  THE  LOGARITHMIC  AND  EXPONENTIAL  FUNCTIONS 


486 


and  therefore  certainly  satisfied  if  \ x  >  log  M,  4 x  >  2N  y/x .} 

14.  If  f(x)  and  4>{x)  tend  to  infinity  as  x  — >  oo,  and  f'(x)/<f/(x )  — >  oo,  then 
f(x)/ 4>(x)  — >  oo.  [Use  the  result  of  Ch.  VI,  Misc.  Ex.  33.]  By  taking  f(x)  =  xa, 
(j>(x)  =  logx,  prove  that  (logx)/x“  — >  0  for  all  positive  values  of  a. 

15.  If  p  and  q  are  positive  integers  then 


1 


+ 


1 


pn  +  1  pn  +  2 


+ 


+  —  -»  log  4 


qn 


V 


as  n  — >  oo.  [Cf.  Ex.  lxxviii.  6.] 

16.  Prove  that  if  x  is  positive  then  nlog{^(l  +  x1/Tl)}  — >  —  \  logx  as  n  — >  oo. 
[We  have 

nlog{|(l  +  a;1/")}  =  nlog{l  —  \(l  —  x 1^n)}  =  \n{  1  —  — — 

where  u  =  |(1  —  x1//n).  Now  use  §  209  and  Ex.  lxxxii.  4.] 

17.  Prove  that  if  a  and  b  are  positive  then 

+  bl/n)}n  Vd). 

[Take  logarithms  and  use  Ex.  16.] 

18.  Show  that 


1  1  1 
1  +  -  +  -  +  --  +  ^-T 


=  \  log  n  +  log  2  +  ^7  +  en, 


where  7  is  Euler’s  constant  (Ex.  LXXXIX.  1)  and  en  — >  0  as  n  — >  00. 

19.  Show  that 

1  '  3  2^5^7  —  2iU&Z’ 

the  series  being  formed  from  the  series  1  —  ^  |  —  . . .  by  taking  alternately  two 

positive  terms  and  then  one  negative.  [The  sum  of  the  first  3n  terms  is 


,  11  ll 

1  +  x  +  7  +  ' ' '  +  - 1 —  x 

3  5  4n  -  1  2 


1  +  2  + 


+ 


n 


=  I  log 2n  +  log 2  +  +  en  —  b(logn  +  7  +  e'n), 
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where  en  and  e'n  tend  to  0  as  n  — >  oo.  (Cf.  Ex.  lxxviii.  6).] 

20.  Show  that  1  —  \  —  jj  +  g  —  g  —  |  +  g  —  jg  —  "  '  =  \  l°g  2. 

21.  Prove  that 

n  1 

E  is(36v2  -  1)  =  "3  +  3Ssn+1  ~^n~Sn 


where  Sn  =  1  4 - |-  •  •  •  4 — ,  Sn  =  1  4 - [-•••4 - .  Hence  prove  that  the 

2  n  3  2n  —  1 

sum  of  the  series  when  continued  to  infinity  is 

-3  +  |  log  3  +  2  log  2. 


22.  Show  that 


(Math.  Trip.  1905.) 


OO  ^  OO  1 

n(An2  —  1)  n(9 n2  —  1) 


|  (log  3  —  1). 


23.  Prove  that  the  sums  of  the  four  series 


E 

i 


i 

4n2  —  1  ’ 


4n2  —  1  ’ 


X 

l 


1 

(2n  +  l)2  -  1' 


X 


(2n  +  l)2  -  1 


are  b  ~  b  b  l  l°g  2  -  |  respectively. 

24.  Prove  that  n!  ( a/n)n  tends  to  0  or  to  oo  according  as  a  <  e  or  a  >  e. 

[If  un  =  n\  ( a/n)n  then  un+i/un  =  a{l  +  (1  /n)}~n  — >  a/e.  It  can  be  shown 

that  the  function  tends  to  oo  when  a  =  e:  for  a  proof,  which  is  rather  be¬ 
yond  the  scope  of  the  theorems  of  this  chapter,  see  Bromwich’s  Infinite  Series , 
pp.  461  et  seq .] 

25.  Find  the  limit  as  x  — >  oo  of 


/  ao  +  aix  +  •  •  •  +  arxr  \  Ao+Al3: 
\bo  +  b\x  4 - 1-  brxr  ) 


distinguishing  the  different  cases  which  may  arise. 


(Math.  Trip.  1886.) 
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26.  Prove  that 

£iog(i  +  £)  (x>°) 

diverges  to  oo.  [Compare  with  JJ(x/n).]  Deduce  that  if  x  is  positive  then 

(1  +  x)(2  +  x) . . .  (n  +  x)/n\  — >  oo 

^  /  x\ 

as  n  — >  oo.  [The  logarithm  of  the  function  is  )T)  log  I  1  H —  1 .] 

V  V  / 

27.  Prove  that  if  x  >  —  1  then 


1 


1 


+ 


1! 


(x  +  l)2  (x  +  l)(.x  +  2)  (x  +  l)(x  +  2)(x  +  3) 

2! 

+ 


(■ x  +  1)  (x  +  2)(x  +  3)(x  +  4) 


+  .... 


(Math.  Trip.  1908.) 

[The  difference  between  l/(x  +  l)2  and  the  sum  of  the  first  n  terms  of  the 
series  is 


1 


n\ 


(x  +  l)2  (x  +  2)(x  +  3) . . .  (x  +  n  +  1)  ’ 

28.  No  equation  of  the  type 

Aeax  +  Bepx  +  •  •  •  =  0, 

where  A,  B,  ...  are  polynomials  and  a,  f3,  .. .  different  real  numbers,  can  hold 
for  all  values  of  x.  [If  a  is  the  algebraically  greatest  of  a,  /3,  . . . ,  then  the 
term  Aeax  outweighs  all  the  rest  as  x  — >  oo.] 

29.  Show  that  the  sequence 


ai  =  e,  a2  =  ee  ,  a3  =  ee  ,  . . . 

tends  to  infinity  more  rapidly  than  any  member  of  the  exponential  scale. 

[Let  e\(x)  =  ex,  e2(x)  =  eei^x\  and  so  on.  Then,  if  ek(x)  is  any  member  of 
the  exponential  scale,  an  >  ek(n)  when  n  >  k.\ 

30.  Prove  that 
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where  a  is  to  be  put  equal  to  tp(x)  and  [5  to  (j)(x)  after  differentiation.  Establish 
a  similar  rule  for  the  differentiation  of 

31.  Prove  that  if  D™e~x2  =  e~x2 4>n(x)  then  (i)  (f>n(x )  is  a  polynomial  of 
degree  n,  (ii)  (j)n+i  =  —2 x<f>n  +  (f>'n ,  and  (iii)  all  the  roots  of  <j>n  =  0  are  real  and 
distinct,  and  separated  by  those  of  4>n-\  =  0.  [To  prove  (iii)  assume  the  truth 
of  the  result  for  k  =  1,  2,  . . . ,  n,  and  consider  the  signs  of  4>n+i  for  the  n  values 
of  x  for  which  <pn  =  0  and  for  large  (positive  or  negative)  values  of  x.] 

32.  The  general  solution  of  f(xy)  =  f(x)f(y),  where  /  is  a  differentiable 
function,  is  x“,  where  a  is  a  constant:  and  that  of 


f{x  +  y)  +  f{x  -y)  =  2  f{x)f(y) 

is  cosh  ax  or  cosax,  according  as  f"(  0)  is  positive  or  negative.  [In  proving  the 
second  result  assume  that  /  has  derivatives  of  the  first  three  orders.  Then 

2/0*0  +  y2{f"{x)  +  ty)  =  2/(*)[/(0)  +  yf'(  0)  +  \y2{f"{  0)  +  <}], 

where  ey  and  e'y  tend  to  zero  with  y.  It  follows  that  /( 0)  =  1,  /'( 0)  =  0, 
f"(x)  =  /"(0)/(x),  so  that  a  =  a//"( 0)  or  a  =  0).] 

33.  How  do  the  functions  Xsm"(i/x) ^  ^cosecli/x)  ^e]-iave  as  x  — >•  +0? 

34.  Trace  the  curves  y  =  tanxetana:,  y  =  sin  x  log  tan  !2x. 

35.  The  equation  ex  =  ax  +  b  has  one  real  root  if  a  <  0  or  a  =  0,  b  >  0. 
If  a  >  0  then  it  has  two  real  roots  or  none,  according  as  a  log  a  >  b  —  a  or 
a  log  a  <  b  —  a. 

36.  Show  by  graphical  considerations  that  the  equation  ex  =  ax2  +  2 bx  +  c 
has  one,  two,  or  three  real  roots  if  a  >  0,  none,  one,  or  two  if  a  <  0;  and  show 
how  to  distinguish  between  the  different  cases. 

showing  that  the  point  (0,  ],)  is  a 


1 


37.  Trace  the  curve  y  =  —  log 

x 


ex  -  1 


x 


centre  of  symmetry,  and  that  as  x  increases  through  all  real  values,  y  steadily 
increases  from  0  to  1.  Deduce  that  the  equation 

sx  —  1' 


-log 

x 


=  a 


has  no  real  root  unless  0  <  a  <  1,  and  then  one,  whose  sign  is  the  same  as  that 
of  a  —  \ .  [In  the  first  place 


y  ~  5  =  -  i  los 


e*  -  1 


—  log 


=  -  log 

x 


(sinh  lx 
— . 
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is  clearly  an  odd  function  of  x.  Also 


%  =  b\  ^CQthix-i-iog 


(sinh  4 x 
— . 


The  function  inside  the  large  bracket  tends  to  zero  as  x  — >  0;  and  its  derivative 

is 


which  has  the  sign  of  x.  Hence  dy/dx  >  0  for  all  values  of  x] 

38.  Trace  the  curve  y  =  el^x\Jx2  +  2x ,  and  show  that  the  equation 


e1^  y/  x2  +  2x  = 


a 


has  no  real  roots  if  a  is  negative,  one  negative  root  if 


0  <  a  <  a  =  e1^ 


2  +  2^2, 


and  two  positive  roots  and  one  negative  if  a  >  a. 

x2  xn 

39.  Show  that  the  equation  fn(x)  =  1  +  x  -\ - -  +  •  •  •  H - -  =  0  has  one  real 

2!  n\ 

root  if  n  is  odd  and  none  if  n  is  even. 

[Assume  this  proved  for  n  =  1,  2,  . . .  2k.  Then  f2k+i(x )  =  0  has  at  least 
one  real  root,  since  its  degree  is  odd,  and  it  cannot  have  more  since,  if  it  had, 
f'2k+i(x)  or  f'2k(x)  would  have  to  vanish  once  at  least.  Hence  f-2k+i{x)  =  0  has 
just  one  root,  and  so  f-2k+2(x)  =  0  cannot  have  more  than  two.  If  it  has  two,  say 
a  and  /?,  then  /2fc+2(*T )  or  f2k+i(x)  must  vanish  once  at  least  between  a  and  (3, 
say  at  7.  And 

72fc+2 

f2k+2 (t )  =  /2fc+i(7 )  +  —  ^yy  >  0. 

But  f2k+2(x)  is  also  positive  when  x  is  large  (positively  or  negatively),  and 
a  glance  at  a  figure  will  show  that  these  results  are  contradictory.  Hence 
f2k+2(x)  =  0  has  no  real  roots.] 

40.  Prove  that  if  a  and  b  are  positive  and  nearly  equal  then 


1 


loSi  =  :  +  t 


1  1 
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approximately,  the  error  being  about  |{(a  —  6)/a}3.  [Use  the  logarithmic  series. 
This  formula  is  interesting  historically  as  having  been  employed  by  Napier  for 
the  numerical  calculation  of  logarithms.] 

41.  Prove  by  multiplication  of  series  that  if  —  1  <  x  <  1  then 

|{l°g(l  +  a:)}2  =  h2  ~  K1  +  \)x?>  +  K1  +  l  +  D®4  “  •  •  •  > 

2  (arc  tan  x)2  =  \x2  -  \{1  +  \)xA  +  g(l  +  g  +  \)x&  - - 


42.  Prove  that 


(1  +  ax)1^  =  e“{l  —  \a2x  +  21(8  +  3a)a3x2(l  +  ex)}, 


where  ex  — >  0  with  x. 

43.  The  first  n  +  2  terms  in  the  expansion  of  log 
in  powers  of  x  are 


1+x  +  lr + 


xn+1  r  i 

n!  \  n  +  1 


+ 


l!  (n  +  2)  2!(n  +  3) 


—  •  •  •  +  (— l)r 


n\  (2 n  +  1) 


44.  Show  that  the  expansion  of 


{Math.  Trip.  1899.) 


x  x 

exp  |  —x - •  •  - - 

1  2  n 


in  powers  of  x  begins  with  the  terms 
1  —  x  + 


„.n+ 1  n 

X  v  > 


X 


,n+s+l 


n 


+  1  (n  +  s){n  +  s  +  1)  ’ 


45.  Show  that  if  —  1  <  x  <  1  then 


{Math.  Trip.  1909.) 


1 

3X  + 


1 

3X  + 


1  •  4 
3H3 
1  •  4 
3H3 


2V 

23x2 


+ 

+ 


1-4-7 

3-6-9 

1-4-7 

3-6-9 


32x3  +  . . . 
33x3  +  . . . 


x{x  +  3) 

9(1  —  x)7/3  ’ 
x(x2  +  18x  +  9) 
27(1  -  x)10/3 
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[Use  the  method  of  Ex.  xcn.  6.  The  results  are  more  easily  obtained  by 
differentiation;  but  the  problem  of  the  differentiation  of  an  infinite  series  is 
beyond  our  range.] 

46.  Prove  that 


dx 

(x  +  a)(x  +  b) 
dx 

(x  +  a){x  +  b )2 
x  dx 

(x  +  a){x  +  b)2 


dx 


(x  +  a)(x2  +  b2) 


x  dx 

(x  +  a)(x2  +  b2) 


— L^{0-6-Mog(5)}, 

(^{olo6(s) 

+  (?)}, 


provided  that  a  and  b  are  positive.  Deduce,  and  verify  independently,  that  each 
of  the  functions 


a— 1  — logo,  a  log  a  —  a  +  1,  ^7ro  —  log  a,  ^n  +  aloga 


is  positive  for  all  positive  values  of  a. 

47.  Prove  that  if  a,  j3,  7  are  all  positive,  and  fj 2  >  cry,  then 


dx 

ax 2  +  2  (3x  +  7 


1  J  f3  +  yjfi2  -  «7 1 


while  if  a  is  positive  and  a'y  >  /3 2  the  value  of  the  integral  is 


1 

— .  arc  tan 

-  P2 


y/  a'y  —  f32 

P 


that  value  of  the  inverse  tangent  being  chosen  which  lies  between  0  and  n.  Are 
there  any  other  really  different  cases  in  which  the  integral  is  convergent? 

48.  Prove  that  if  a  >  —1  then 


dx 

(x  +  a)y/x2  —  1 


dt 


1 0  cosh  t  +  a 


du 

u 2  +  2  au  +  1  ’ 
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and  deduce  that  the  value  of  the  integral  is 


vA  - 


:  arc  tan 


1  —  a 
1  T  a 


if  —  1  <  a  <  1,  and 


1  \/fl  +  1  +  y/a  —  1 

log 


y/a2  —  1  yj  a  +  1  —  y/a  —  1  y/a 2  —  1 

if  a  >  1.  Discuss  the  case  in  which  a  =  1. 

f°°  dx 

49.  Transform  the  integral 
ways,  showing  that  its  value  is 


arg  tanh 


a  —  1 
Cl  T  1 


/  o  (x  +  a)  y/x2  +  1 


,  where  a  >  0,  in  the 


1  (i  +  1  +  +  1 

log 


Vo2  +  1  a  +  1  —  Va2  +  1  Va2  +  1 
50.  Prove  that 


2  ,  \/o2  +  1 

arg  tanh - . 

CL  ~ hi 


arc  tan  xdx  =  \tc  —  \  log  2. 


'o 


4  2 


51.  If  0  <  a  <  1,  0  <  /?  <  1,  then 
Z'1  dx 


1  1  +  yfcvB 

log 


l-i  y/(\  —  2ax  +  a2)(l  —  2^x  +  (32)  y/af3  1  —  \/a/3 


52.  Prove  that  if  a  >  b  >  0  then 
^°°  d6 


L 


71 


_OQ  a  cosh  9  +  b  sinh  6  y/a2  —  b2 


53.  Prove  that 

f1  logx 


00  log  x 

dx  =  —  I  - — : — dx, 


1  +  X2  Ji  1  +  X2 


logx 

1  +  X2 


dx  =  0, 


and  deduce  that  if  a  >  0  then 


log  X  71 

2  i - 9  (1X  =  5-l°g«- 

az  +  xz  la 


same 
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[Use  the  substitutions  x  =  1/t  and  x  = 
54.  Prove  that 


au .] 


dx  =  ira 


if  a  >  0.  [Integrate  by  parts.] 


CHAPTER  X 


THE  GENERAL  THEORY  OF  THE  LOGARITHMIC,  EXPONENTIAL, 
AND  CIRCULAR  FUNCTIONS 

217.  Functions  of  a  complex  variable.  In  Ch.  Ill  we  defined  the 
complex  variable 

z  =  x  +  iy,* 

and  we  considered  a  few  simple  properties  of  some  classes  of  expressions 
involving  z,  such  as  the  polynomial  P(z).  It  is  natural  to  describe  such 
expressions  as  functions  of  z,  and  in  fact  we  did  describe  the  quotient 
P(z)/Q(z),  where  P(z)  and  Q(z)  are  polynomials,  as  a  ‘rational  function’. 
We  have  however  given  no  general  definition  of  what  is  meant  by  a  function 
of  z. 

It  might  seem  natural  to  define  a  function  of  z  in  the  same  way  as  that 
in  which  we  defined  a  function  of  the  real  variable  x ,  i.e.  to  say  that  Z  is 
a  function  of  z  if  any  relation  subsists  between  0  and  Z  in  virtue  of  which 
a  value  or  values  of  Z  corresponds  to  some  or  all  values  of  z.  But  it  will 
be  found,  on  closer  examination,  that  this  definition  is  not  one  from  which 
any  profit  can  be  derived.  For  if  z  is  given,  so  are  x  and  y,  and  conversely: 
to  assign  a  value  of  z  is  precisely  the  same  thing  as  to  assign  a  pair  of  values 
of  x  and  y.  Thus  a  ‘function  of  z\  according  to  the  definition  suggested, 
is  precisely  the  same  thing  as  a  complex  function 

f(x,y)  +  ig(x,y), 

of  the  two  real  variables  x  and  y.  For  example 

x  —  iy,  xy,  \z\  —  \J  x2  +  y2,  amz  —  arctan(t//x) 

are  ‘functions  of  z\  The  definition,  although  perfectly  legitimate,  is  futile 
because  it  does  not  really  define  a  new  idea  at  all.  It  is  therefore  more 
convenient  to  use  the  expression  ‘function  of  the  complex  variable  Z'  in  a 

*In  this  chapter  we  shall  generally  find  it  convenient  to  write  x  +  iy  rather  than 
x  +  yi. 


495 


[X  :  219]  THE  GENERAL  THEORY  OF  THE  LOGARITHMIC, 


496 


more  restricted  sense,  or  in  other  words  to  pick  out,  from  the  general  class  of 
complex  functions  of  the  two  real  variables  x  and  y,  a  special  class  to  which 
the  expression  shall  be  restricted.  But  if  we  were  to  attempt  to  explain 
how  this  selection  is  made,  and  what  are  the  characteristic  properties  of  the 
special  class  of  functions  selected,  we  should  be  led  far  beyond  the  limits  of 
this  book.  We  shall  therefore  not  attempt  to  give  any  general  definitions, 
but  shall  confine  ourselves  entirely  to  special  functions  defined  directly. 

218.  We  have  already  defined  polynomials  in  z  (§  39),  rational  func¬ 
tions  of  z  (§  46),  and  roots  of  z  (§  47).  There  is  no  difficulty  in  extending 
to  the  complex  variable  the  definitions  of  algebraical  functions ,  explicit  and 
implicit,  which  we  gave  (§§  26-27)  in  the  case  of  the  real  variable  x.  In 
all  these  cases  we  shall  call  the  complex  number  z,  the  argument  (§  44) 
of  the  point  z,  the  argument  of  the  function  f(z)  under  consideration. 
The  question  which  will  occupy  us  in  this  chapter  is  that  of  defining  and 
determining  the  principal  properties  of  the  logarithmic,  exponential,  and 
trigonometrical  or  circular  functions  of  0.  These  functions  are  of  course 
so  far  defined  for  real  values  of  z  only,  the  logarithm  indeed  for  positive 
values  only. 

We  shall  begin  with  the  logarithmic  function.  It  is  natural  to  attempt 
to  define  it  by  means  of  some  extension  of  the  definition 


and  in  order  to  do  this  we  shall  find  it  necessary  to  consider  briefly  some 
extensions  of  the  notion  of  an  integral. 

219.  Real  and  complex  curvilinear  integrals.  Let  AB  be  an 

arc  C  of  a  curve  defined  by  the  equations 

x  =  000,  y  =  0(0, 

where  0  and  0  are  functions  of  t  with  continuous  differential  coefficients  0' 
and  0';  and  suppose  that,  as  t  varies  from  t0  to  0,  the  point  (x,y)  moves 
along  the  curve,  in  the  same  direction,  from  A  to  B. 
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Then  we  define  the  curvilinear  integral 


{g(x,  y)  dx  +  h(x,  y)  dy}, 


(1) 


ic 


where  g  and  h  are  continuous  functions  of  x  and  y,  as  being  equivalent 
to  the  ordinary  integral  obtained  by  effecting  the  formal  substitutions 
x  =  y  =  i.e.  to 


{g{(t>^W  +  dt- 


We  call  C  the  path  of  integration. 
Let  us  suppose  now  that 


z  =  x  +  iy  =  <f>(t)  + 


so  that  z  describes  the  curve  C  in  Argand’s  diagram  as  t  varies.  Further 
let  us  suppose  that 

f(z)  —  u  +  iv 


is  a  polynomial  in  z  or  rational  function  of  z. 
Then  we  define 


f(z ) dz 


rc 


as  meaning 


(u  +  iv)(dx  +  i  dy), 
which  is  itself  defined  as  meaning 


'  c 


(2) 


v  dy)  +  i  /  (vdx  +  u  dy) . 


ic 


220.  The  definition  of  Log  (.  Now  let  f  —  £  +  irj  be  any  complex 
number.  We  define  Log  the  general  logarithm  of  (,  by  the  equation 
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where  C  is  a  curve  which  starts  from  1  and  ends  at  Q  and  does  not  pass 
through  the  origin.  Thus  (Fig.  54)  the  paths  (a),  (b),  (c)  are  paths  such 
as  are  contemplated  in  the  definition.  The  value  of  Log  z  is  thus  defined 
when  the  particular  path  of  integration  has  been  chosen.  But  at  present  it 
is  not  clear  how  far  the  value  of  Log  z  resulting  from  the  definition  depends 
upon  what  path  is  chosen.  Suppose  for  example  that  (  is  real  and  positive, 
say  equal  to  £.  Then  one  possible  path  of  integration  is  the  straight  line 
from  1  to  £,  a  path  which  we  may  suppose  to  be  defined  by  the  equations 


Fig.  54. 


x  =  t,  y  =  0.  In  this  case,  and  with  this  particular  choice  of  the  path  of 
integration,  we  have 


Log£ 


J 


so  that  Log£  is  equal  to  log£,  the  logarithm  of  £  according  to  the  definition 
given  in  the  last  chapter.  Thus  one  value  at  any  rate  of  Log£,  when  £  is 
real  and  positive,  is  log£.  But  in  this  case,  as  in  the  general  case,  the  path 
of  integration  can  be  chosen  in  an  infinite  variety  of  different  ways.  There 
is  nothing  to  show  that  every  value  of  Log£  is  equal  to  log£;  and  in  point 
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of  fact  we  shall  see  that  this  is  not  the  case.  This  is  why  we  have  adopted 
the  notation  Log£,  Log£  instead  of  log£,  log£.  Log£  is  (possibly  at  any 
rate)  a  many  valued  function,  and  log  £  is  only  one  of  its  values.  And  in  the 
general  case,  so  far  as  we  can  see  at  present,  three  alternatives  are  equally 
possible,  viz.  that 

(1)  we  may  always  get  the  same  value  of  Log£,  by  whatever  path  we 

go  from  1  to  £; 

(2)  we  may  get  a  different  value  corresponding  to  every  different  path; 

(3)  we  may  get  a  number  of  different  values  each  of  which  corresponds 

to  a  whole  class  of  paths: 

and  the  truth  or  falsehood  of  any  one  of  these  alternatives  is  in  no  way 
implied  by  our  definition. 


221.  The  values  of  Log  £.  Let  us  suppose  that  the  polar  coordinates 
of  the  point  z  =  £  are  p  and  0,  so  that 

£  =  p( cos  (j)  +  i  sin  £>) . 

We  suppose  for  the  present  that  —7 r  <  0  <  7 r,  while  p  may  have  any  positive 
value.  Thus  £  may  have  any  value  other  than  zero  or  a  real  negative  value. 

The  coordinates  (x,  y)  of  any  point  on  the  path  C  are  functions  of  t, 
and  so  also  are  its  polar  coordinates  ( r,9 ).  Also 


Log  £  =  /  —  = 


'c 

rt  i 


'to 


r  dx  +  idy 
lc  x  +  iy 


X  +  27/  \  dt  dt 


in  virtue  of  the  definitions  of  §  219.  But  x  =  r  cos#,  y  =  r  sin 9,  and 

d9\ 
dt ) 


dx  dy  (  dr  /  „  dr  „  dO \ 

—  +  i—  =  (  cos  0  —  —  r  sm  0  —  I  +  i  (  sin  0  —  +  r  cos  6  —  ) 


dt  dt 


dt 


/  „  .  .  /  dr  .  d6 

=  (cos  a  +  i  sm  6)  I  — — K  ir  — 


dt 


dt 
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so  that 

/'L  1  dr  d.0 

L°gC  =  i,  rdtdt+ll0  =  [logr]+i[e], 

where  [log  r]  denotes  the  difference  between  the  values  of  log  r  at  the  points 
corresponding  to  t  =  t\  and  t  =  to,  and  [6]  has  a  similar  meaning. 

It  is  clear  that 

[log  r]  =  log  p  —  log  1  =  log  p; 

but  the  value  of  [0]  requires  a  little  more  consideration.  Let  us  suppose 
first  that  the  path  of  integration  is  the  straight  line  from  1  to  (.  The  initial 
value  of  6  is  the  amplitude  of  1,  or  rather  one  of  the  amplitudes  of  1,  viz. 


2kn,  where  k  is  any  integer.  Let  us  suppose  that  initially  6  =  2kn.  It  is 
evident  from  the  figure  that  6  increases  from  2kn  to  2kir  +  cp  as  t  moves 
along  the  line.  Thus 

[0]  =  (2kn  +  4>)  —  2/c7r  =  cf), 

and,  when  the  path  of  integration  is  a  straight  line,  LogC  =  logp  +  i<j). 

We  shall  call  this  particular  value  of  Log  Q  the  principal  value.  When 
C  is  real  and  positive,  (  =  p  and  0  =  0,  so  that  the  principal  value  of  LogC 
is  the  ordinary  logarithm  logd-  Hence  it  will  be  convenient  in  general  to 
denote  the  principal  value  of  LogC  by  logd-  Thus 

logC  =  log  p  +  i(p, 

and  the  principal  value  is  characterised  by  the  fact  that  its  imaginary  part 
lies  between  —  tt  and  tt. 
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Next  let  us  consider  any  path  (such  as  those  shown  in  Fig.  56)  such  that 
the  area  or  areas  included  between  the  path  and  the  straight  line  from  1 


to  £  does  not  include  the  origin.  It  is  easy  to  see  that  [0]  is  still  equal  to  f>. 
Along  the  curve  shown  in  the  figure  by  a  continuous  line,  for  example,  9, 
initially  equal  to  2kn,  first  decreases  to  the  value 

2kn  -  XOP 

and  then  increases  again,  being  equal  to  2kn  at  Q,  and  finally  to  2kir  +  </>. 
The  dotted  curve  shows  a  similar  but  slightly  more  complicated  case  in 
which  the  straight  line  and  the  curve  bound  two  areas,  neither  of  which 
includes  the  origin.  Thus  if  the  path  of  integration  is  such  that  the  closed 
curve  formed  by  it  and  the  line  from  1  to  £  does  not  include  the  origin, 
then 

Log  £  =  log£  =  log  p  +  if. 

On  the  other  hand  it  is  easy  to  construct  paths  of  integration  such 
that  [9]  is  not  equal  to  £>.  Consider,  for  example,  the  curve  indicated  by 
a  continuous  line  in  Fig.  57.  If  9  is  initially  equal  to  2ki\,  it  will  have 
increased  by  2i r  when  we  get  to  P  and  by  47t  when  we  get  to  Q ;  and  its 
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final  value  will  be  2kir  +  An  +  f,  so  that  [6]  =  An  +  f  and 

Log  (  =  logp  +  i(  Air  +  4>). 


In  this  case  the  path  of  integration  winds  twice  round  the  origin  in  the 
positive  sense.  If  we  had  taken  a  path  winding  k  times  round  the  origin 
we  should  have  found,  in  a  precisely  similar  way,  that  [6]  =  2kir  +  and 

Log  C  =  logp  +  i(2kir  +  </>). 

Here  k  is  positive.  By  making  the  path  wind  round  the  origin  in  the 
opposite  direction  (as  shown  in  the  dotted  path  in  Fig.  57),  we  obtain 
a  similar  series  of  values  in  which  k  is  negative.  Since  |C|  =  p,  and  the 
different  angles  2kn  +  (j)  are  the  different  values  of  am  (,  we  conclude  that 
every  value  of  log  |C|  +  *  am  (  is  a  value  of  Log£;  and  it  is  clear  from  the 
preceding  discussion  that  every  value  of  Log  (  must  be  of  this  form. 

We  may  summarise  our  conclusions  as  follows:  the  general  value  of 
Log  £  is 

log  | C|  +  i  am C  =  logp  +  i(2kn  +  0), 

where  k  is  any  positive  or  negative  integer.  The  value  of  k  is  determined 
by  the  path  of  integration  chosen.  If  this  path  is  a  straight  line  then  k  =  0 
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and 

Log  (  =  log  (  =  log  p  +  icj). 

In  what  precedes  we  have  used  (  to  denote  the  argument  of  the  func¬ 
tion  LogC,  and  (£,??)  or  (p,  0)  to  denote  the  coordinates  of  and  z,  (x,y), 
(r,  9)  to  denote  an  arbitrary  point  on  the  path  of  integration  and  its  coor¬ 
dinates.  There  is  however  no  reason  now  why  we  should  not  revert  to  the 
natural  notation  in  which  z  is  used  as  the  argument  of  the  function  Log  z, 
and  we  shall  do  this  in  the  following  examples. 

Examples  XCIII.  1.  We  supposed  above  that  —  n  <  9  <  it,  and  so 
excluded  the  case  in  which  z  is  real  and  negative.  In  this  case  the  straight 
line  from  1  to  z  passes  through  0,  and  is  therefore  not  admissible  as  a  path  of 
integration.  Both  n  and  —  n  are  values  of  amz,  and  6  is  equal  to  one  or  other 
of  them:  also  r  =  —  z.  The  values  of  Logz  are  still  the  values  of  log  |z|  +  iamz, 
viz. 

log(-z)  +  (2k  +  l)7r  i, 

where  k  is  an  integer.  The  values  log(— z)  +  ni  and  log(— z)  —  ni  correspond  to 
paths  from  1  to  z  lying  respectively  entirely  above  and  entirely  below  the  real 
axis.  Either  of  them  may  be  taken  as  the  principal  value  of  Log  z,  as  convenience 
dictates.  We  shall  choose  the  value  log(— z)  +  vr  i  corresponding  to  the  first  path. 

2.  The  real  and  imaginary  parts  of  any  value  of  Logz  are  both  continuous 
functions  of  x  and  y ,  except  for  x  =  0,  y  =  0. 

3.  The  functional  equation  satisfied  by  Logz.  The  function  Logz 
satisfies  the  equation 

Log  Z1Z2  =  Log  zi  +  Log  Z2,  (1) 

in  the  sense  that  every  value  of  either  side  of  this  equation  is  one  of  the  values 
of  the  other  side.  This  follows  at  once  by  putting 

zi  =  ri(cos  9\  +  i  sin  9 1),  Z2  =  r2(cos  9-2  +  i  sin  $2), 
and  applying  the  formula  of  p.  503.  It  is  however  not  true  that 


log  ZlZ2  =  log  zi  +  log  Z2 


(2) 
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in  all  circumstances.  If,  e.g., 

z\  =  Z2  =  |(— 1  +  iV 3)  =  cos  1 7 t  +  i  sin  |7 r, 

then  log^i  =  logZ2  =  §7ri,  and  logzi  +  logZ2  =  which  is  one  of  the  values 
of  LogziZ2,  but  not  the  principal  value.  In  fact  log  z\Z2  =  —  |7ri. 

An  equation  such  as  (1),  in  which  every  value  of  either  side  is  a  value  of  the 
other,  we  shall  call  a  complete  equation,  or  an  equation  which  is  completely  true. 

4.  The  equation  Log  zm  =  m  Log  z,  where  m  is  an  integer,  is  not  completely 
true:  every  value  of  the  right-hand  side  is  a  value  of  the  left-hand  side,  but  the 
converse  is  not  true. 

5.  The  equation  Log(l/z)  =  —  Log  z  is  completely  true.  It  is  also  true  that 
log(l/z)  =  —  logz,  except  when  z  is  real  and  negative. 

6.  The  equation 


log 


z  —  a\ 
z-b) 


log(z  —  a)  —  log(z  —  b ) 


is  true  if  z  lies  outside  the  region  bounded  by  the  line  joining  the  points  z  =  a, 
z  =  b,  and  lines  through  these  points  parallel  to  OX  and  extending  to  infinity 
in  the  negative  direction. 

7.  The  equation 


log  {hi)  = log  i1  -  “)  - log  i1 


is  true  if  z  lies  outside  the  triangle  formed  by  the  three  points  O,  a,  b. 

8.  Draw  the  graph  of  the  function  I(Logx)  of  the  real  variable  x.  [The 
graph  consists  of  the  positive  halves  of  the  lines  y  =  2kir  and  the  negative  halves 
of  the  lines  y  =  (2 k  +  l)7r.] 

9.  The  function  f(x)  of  the  real  variable  x,  defined  by 


vr/(x)  =pvr  +  (q-p)  I(logx), 


is  equal  to  p  when  x  is  positive  and  to  q  when  x  is  negative. 

10.  The  function  f(x)  defined  by 


7 r/(x)  =  piT  +  (q  —  p)  I{log(x  —  1)}  +  (r  —  q)  I(logx) 
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is  equal  to  p  when  x  >  1,  to  q  when  0  <  x  <  1,  and  to  r  when  x  <  0. 

1 1 .  For  what  values  of  z  is  (i)  log  2  (ii)  any  value  of  Log  z  ( a)  real  or  ( b)  purely 
imaginary? 

12.  If  z  =  x  +  iy  then  Log  Log  z  =  logi?  +  i(0  +  2k' n),  where 

R2  =  (logr)2  +  (9  +  2kn)2 

and  0  is  the  least  positive  angle  determined  by  the  equations 

cos©  :  sin©  :  1  ::  logr  :  6  +  2kn  :  \J (log r)2  +  (9  +  2/c7r)2. 

Plot  roughly  the  doubly  infinite  set  of  values  of  LogLog(l  +  iy/ 3),  indicating 
which  of  them  are  values  of  logLog(l  +  iy/3)  and  which  of  Loglog(l  +  iy/ 3). 

222.  The  exponential  function.  In  Ch.  IX  we  defined  a  function  ey 
of  the  real  variable  y  as  the  inverse  of  the  function  y  =  log  x.  It  is  naturally 
suggested  that  we  should  define  a  function  of  the  complex  variable  z  which 
is  the  inverse  of  the  function  Log  z. 

Definition.  If  any  value  of  Log  z  is  equal  to  (,  we  call  z  the  exponen¬ 
tial  of  (  and  write 

z  =  exp  (. 

Thus  z  =  exp£  if  C  =  Log£.  It  is  certain  that  to  any  given  value  of  z 
correspond  infinitely  many  different  values  of  (.  It  would  not  be  unnatural 
to  suppose  that,  conversely,  to  any  given  value  of  (  correspond  infinitely 
many  values  of  z,  or  in  other  words  that  exp  (  is  an  infinitely  many-valued 
function  of  (.  This  is  however  not  the  case,  as  is  proved  by  the  following 
theorem. 

Theorem.  The  exponential  function  exp£  is  a  one-valued  function 

of  C 

For  suppose  that 

z  i  =  r1(cos0i  -t-isin^x),  z2  =  r2(cos  02  +  i  sin  d2) 
are  both  values  of  exp  (.  Then 


C  =  Log  Z\  =  Log  22, 
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and  so 

logrr  +  i{9\  +  2mir)  =  logr2  +  i(92  +  2/z7t)  , 
where  m  and  n  are  integers.  This  involves 

logri  =  logr2,  9\  +  2rmr  =  62  +  2mr. 

Tims  ri  =  r 2l  and  6£  and  92  differ  by  a  multiple  of  2n.  Hence  Z\  =  z2. 

Corollary.  Iff  is  real  then  exp£  =  e^,  the  real  exponential  function 
off  defined  in  Ch.  IX. 

For  if  z  =  e^  then  logz  =  £,  i.e.  one  of  the  values  of  Log  z  is  £.  Hence 
z  =  exp  C- 

223.  The  value  of  exp  £.  Let  £  =  £  +  iij  and 

z  =  exp  C  =  r(cos  9  +  i  sin  9). 

Then 

£  +  ip  =  Logz  =  logr  +  i{6  +  2mn), 
where  m  is  an  integer.  Hence  £  =  logr,  77  =  9  +  2mn,  or 

r  =  e^,  9  =  rj  —  2mir, 

and  accordingly 

exp(£  +  irf)  =  e^(cos  r]  +  i  sin  rj). 

If  7]  =  0  then  exp£  =  e^,  as  we  have  already  inferred  in  §  222.  It  is  clear 
that  both  the  real  and  the  imaginary  parts  of  exp(£  +  irf)  are  continuous 
functions  of  £  and  77  for  all  values  of  £  and  77. 

224.  The  functional  equation  satisfied  by  exp  £.  Let  £1  =  £1+777!, 
£2  =  £2  +  b?2-  Then 

exp  £1  x  exp  £2  =  e^1  (cos  771  +  7  sin  771)  x  e^2  (cos  772  +  7  sin  772) 

=  e^1+?2{cos(77i  +  772)  +  7sin(?7i  +  772)} 

—  exp(£i  +  £2). 

The  exponential  function  therefore  satisfies  the  functional  relation 
/(£  1  +  £2)  =  /(£i)/(£2),  an  equation  which  we  have  proved  already  (§  205) 
to  be  true  for  real  values  of  £1  and  £2. 
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225.  The  general  power  a C  It  might  seem  natural,  as  exp£  = 
when  £  is  real,  to  adopt  the  same  notation  when  £  is  complex  and  to  drop 
the  notation  expC  altogether.  We  shall  not  follow  this  course  because  we 
shall  have  to  give  a  more  general  definition  of  the  meaning  of  the  symbol  eS: 
we  shall  find  then  that  e ^  represents  a  function  with  infinitely  many  values 
of  which  exp  £  is  only  one. 

We  have  already  defined  the  meaning  of  the  symbol  a ^  in  a  considerable 
variety  of  cases.  It  is  defined  in  elementary  Algebra  in  the  case  in  which 
a  is  real  and  positive  and  (  rational,  or  a  real  and  negative  and  (  a  rational 
fraction  whose  denominator  is  odd.  According  to  the  definitions  there  given 
cb  has  at  most  two  values.  In  Ch.  Ill  we  extended  our  definitions  to  cover 
the  case  in  which  a  is  any  real  or  complex  number  and  C  any  rational 
number  p/q]  and  in  Ch.  IX  we  gave  a  new  definition,  expressed  by  the 
equation 

ac  =  ecloga, 

which  applies  whenever  (  is  real  and  a  real  and  positive. 

Thus  we  have,  in  one  way  or  another,  attached  a  meaning  to  such 
expressions  as 

31/2,  (-1)1/3,  (v^T  |i)“1/2,  (3.5)1+v/2; 

but  we  have  as  yet  given  no  definitions  which  enable  us  to  attach  any 
meaning  to  such  expressions  as 

(1  +  i)^,  2\  (3  +  2  i)2+3i. 

We  shall  now  give  a  general  definition  of  gA  which  applies  to  all  values  of 
a  and  (,  real  or  complex,  with  the  one  limitation  that  a  must  not  be  equal 
to  zero. 

Definition.  The  function  cA  is  defined  by  the  equation 

<A  =  exp(<C  Log  a) 

where  Log  a  is  any  value  of  the  logarithm  of  a. 

We  must  first  satisfy  ourselves  that  this  definition  is  consistent  with  the 
previous  definitions  and  includes  them  all  as  particular  cases. 
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(1)  If  a  is  positive  and  (  real,  then  one  value  of  (  Log  a,  viz.  ( log  a,  is 
real:  and  exp(</  log  a)  =  e^loga,  which  agrees  with  the  definition  adopted  in 
Ch.  IX.  The  definition  of  Ch.  IX  is,  as  we  saw  then,  consistent  with  the 
definition  given  in  elementary  Algebra;  and  so  our  new  definition  is  so  too. 

(2)  If  a  =  eT(cos,0  +  isin-0),  then 

Log  a  =  r  +  i(-0  +  2rmr), 
exp {(p/q)  Log  a}  =  epT^q  Cis {(p/q){^>  +  2m7r)}, 

where  m  may  have  any  integral  value.  It  is  easy  to  see  that  if  m  assumes  all 
possible  integral  values  then  this  expression  assumes  q  and  only  q  different 
values,  which  are  precisely  the  values  of  ap'q  found  in  §  48.  Hence  our  new 
definition  is  also  consistent  with  that  of  Ch.  III. 

226.  The  general  value  of  aC  Let 

£  =  £  +  irj,  a  =  a(cos-?/’ +  i  sin-?/’) 

where  —  n  <  i/j  A  n,  so  that,  in  the  notation  of  §  225,  cr  =  eT  or  r  =  log  a. 

Then 


C  Log  a  =  (£  +  irj) {log  a  +  i(-0  +  2m7r)}  =  L  +  iM, 


where 


L  =  £  log  a  —  77(1/  +  2mn),  M  =  //log  a  +  £(-0  +  2rmr); 

and 

a ^  =  exp(£  Log  a)  =  eL(cos  M  +  i  sin  M). 

Thus  the  general  value  of  a ^  is 

e^iogo--?j(V’+2m7r)|-cos|^|0gcr  _|_  _|_  2m7r)}  +  isin{7/  log  a  +  £(-0  +  2m7r)}]. 

In  general  a **  is  an  inhnitely  many-valued  function.  For 

lad  =  e 


£  log  a— r)(il)+2mn) 
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has  a  different  value  for  every  value  of  m,  unless  7]  =  0.  If  on  the  other 
hand  r ]  =  0,  then  the  moduli  of  all  the  different  values  of  or  are  the  same. 
But  any  two  values  differ  unless  their  amplitudes  are  the  same  or  differ  by 
a  multiple  of  2n.  This  requires  that  £(0  +  2mn)  and  £(0  +  2ml),  where 
m  and  n  are  different  integers,  shall  differ,  if  at  all,  by  a  multiple  of  27t. 
But  if 

£(0  +  2mn)  —  £(0  +  2rm)  =  2kn, 

then  £  =  k/(m  —  n)  is  rational.  We  conclude  that  <0  is  infinitely  many- 
valued,  unless  £  is  real  and  rational.  On  the  other  hand  we  have  already 
seen  that,  when  £  is  real  and  rational,  a ^  has  but  a  finite  number  of  values. 

The  principal  value  of  a ^  =  exp(£Loga)  is  obtained  by  giving  Log  a  its 
principal  value,  i.e.  by  supposing  m  =  0  in  the  general  formula.  Thus  the 
principal  value  of  a ^  is 

e^los°— ^{cos^logcr  +  £0)  +  isin(ry  log  cr  +  £0)}. 

Two  particular  cases  are  of  especial  interest.  If  a  is  real  and  positive  and 
£  real,  then  cr  =  a,  0  =  0,  £  =  £,  y  =  0,  and  the  principal  value  of  a is  e^loga, 
which  is  the  value  defined  in  the  last  chapter.  If  |a|  =  1  and  £  is  real,  then  cr  =  1, 
£  =  £)  V  =  0,  and  the  principal  value  of  (cos0  +  i  sin  00  is  cos£0  +  isin£0. 
This  is  a  further  generalisation  of  De  Moivre’s  Theorem  (§§  45,  49). 

Examples  XCIV.  1.  Find  all  the  values  of  0.  [By  definition 

il  =  exp(i  Log  i). 

But 

i  =  cos  tj7t  +  i  sin  ^7r,  Log  i  =  (2k  +  ^)ni, 
where  k  is  any  integer.  Hence 

0  =  exp{  — (2A:  +  \)ti  }  =  e~^2k+^ . 

All  the  values  of  il  are  therefore  real  and  positive.] 

2.  Find  all  the  values  of  (1  +  ifi,  i1+l,  (1  +  i)1+l. 

3.  The  values  of  c0 ,  when  plotted  in  the  Argand  diagram,  are  the  vertices 

of  an  equiangular  polygon  inscribed  in  an  equiangular  spiral  whose  angle  is 
independent  of  a.  (Math.  Trip.  1899.) 
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[If  a =  r(cos  9  +  i  sin  6)  we  have 

r  =  e€iog<r-Tj(^+2m7r)}  e  =  r,loga  +  ^  +  2rmr)- 

and  all  the  points  lie  on  the  spiral  r  =  (j^2+r,~'>^e~r,e^ .] 

4.  The  function  e*.  If  we  write  e  for  a  in  the  general  formula,  so  that 
logo-  =  1,  ip  =  0,  we  obtain 

e ^  =  e^~2mnv {cos(r]  +  2mn £)  +  i  sin(?/  +  2rmr£>)}. 

The  principal  value  of  is  e^(cos  rj  +  i  sin  rj),  which  is  equal  to  exp£  (§  223).  In 
particular,  if  £  is  real,  so  that  rj  =  0,  we  obtain 

e^(cos  2m7r£  +  i  sin  2m7r£) 

as  the  general  and  as  the  principal  value,  denoting  here  the  positive  value 
of  the  exponential  defined  in  Ch.  IX. 

5.  Show  that  Loge^  =  (l  +  2m7ri)£  +  2n7ri,  where  m  and  n  are  any  integers, 
and  that  in  general  Loga^  has  a  double  infinity  of  values. 

6.  The  equation  1/cS-  =  aO*’  is  completely  true  (Ex.  xcm.  3):  it  is  also  true 
of  the  principal  values. 

7.  The  equation  X  =  (ab)^  is  completely  true  but  not  always  true  of 
the  principal  values. 

8.  The  equation  x  a£'  =  a^+^'  is  not  completely  true,  but  is  true  of  the 
principal  values.  [Every  value  of  the  right-hand  side  is  a  value  of  the  left-hand 
side,  but  the  general  value  of  af>  X  ,  viz. 

exp{C(logo  +  2miri)  +  ("'(logo  +  2mri)}, 

is  not  as  a  rule  a  value  of  unless  m  =  n.\ 

9.  What  are  the  corresponding  results  as  regards  the  equations 

Log  a**  =  £  Log  a,  (a^)^  =  (a^)^  =  a^'l 

10.  For  what  values  of  £  is  (a)  any  value  (6)  the  principal  value  of  (i)  real 
(ii)  purely  imaginary  (iii)  of  unit  modulus? 

11.  The  necessary  and  sufficient  conditions  that  all  the  values  of  (X  should 
be  real  are  that  2£  and  { 77  log  |a|  +  £ama}/7r,  where  am  a  denotes  any  value  of 
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the  amplitude,  should  both  be  integral.  What  are  the  corresponding  conditions 
that  all  the  values  should  be  of  unit  modulus? 

12.  The  general  value  of  \xl  +  x~l\,  where  x  >  0,  is 

g— (m— n)  7r  .y/2{cosh2(m  +  n)7r  +  cos(21ogx)}. 


13.  Explain  the  fallacy  in  the  following  argument:  since  e2mm  =  e2nm  =  1, 
where  m  and  n  are  any  integers,  therefore,  raising  each  side  to  the  power  i  we 
obtain  e~2mn  =  e~2mr. 

14.  In  what  circumstances  are  any  of  the  values  of  xx ,  where  x  is  real, 
themselves  real?  [If  x  >  0  then 

xx  =  exp(xLogx)  =  exp(xlogx)  Cis2m7rx, 

the  first  factor  being  real.  The  principal  value,  for  which  m  =  0,  is  always  real. 

If  x  is  a  rational  fraction  p/{2q  +  1),  or  is  irrational,  then  there  is  no  other 
real  value.  But  if  x  is  of  the  form  p/2q ,  then  there  is  one  other  real  value,  viz. 
—  exp(xlogx),  given  by  m  =  q. 

If  x  =  —  £  <  0  then 


xx  =  exp{— £  Log(— £)}  =  exp(— £  log£)  Cis{-(2?n  +  l)7r^}. 


The  only  case  in  which  any  value  is  real  is  that  in  which  £  =  p/(2q  +  1),  when 
m  =  q  gives  the  real  value 

exp(— £  log  £)  Cis(-pvr)  =  (-l)p£“5. 

The  cases  of  reality  are  illustrated  by  the  examples 

(U1/3  =  yi.  (§>*  =  ±4.  (-!)->  =  H)-s  =  -^.] 


15.  Logarithms  to  any  base.  We  may  define  £  =  Loga  z  in  two  different 
ways.  We  may  say  (i)  that  £  =  Loga  z  if  the  principal  value  of  a is  equal  to  z; 
or  we  may  say  (ii)  that  £  =  Loga  z  if  any  value  of  a is  equal  to  z. 

Thus  if  a  =  e  then  £  =  Loge  z,  according  to  the  first  definition,  if  the  principal 
value  of  e*’  is  equal  to  z,  or  if  exp£  =  z;  and  so  Loge  z  is  identical  with  Log;?. 
But,  according  to  the  second  definition,  £  =  Loge  z  if 

=  exp(£  Log  e)  =  z,  £  Log  e  =  Log  z, 
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or  (  =  (Logz)/(Loge),  any  values  of  the  logarithms  being  taken.  Thus 


C  =  Loge  z 


log  \z\  +  (amz  +  2rmr)i 
1  +  2mri 


so  that  C  is  a  doubly  infinitely  many- valued  function  of  z.  And  generally,  ac¬ 
cording  to  this  definition,  Log az  =  (Logz)/(Loga). 

16.  Loge  1  =  2mm /(I  +  2niri),  Loge(— 1)  =  (2m  +  l)7ri/(l  +  2 mri),  where 
m  and  n  are  any  integers. 


227.  The  exponential  values  of  the  sine  and  cosine.  From  the 
formula 

exp  (£  +  irj)  —  exp  £ (cos  77  +  7  sin  rj) , 

we  can  deduce  a  number  of  extremely  important  subsidiary  formulae.  Tak¬ 
ing  £  =  0,  we  obtain  exp(777)  =  cos  77  +  7  sin  77;  and,  changing  the  sign  of  77, 
exp(— 777)  =  cos  77  —  i  sin  77.  Hence 

cos  77  =  \  {exp  (777)  +  exp  (—777)}, 

sin  77  =  —  |i{exp(777)  —  exp(— 777)}. 

We  can  of  course  deduce  expressions  for  any  of  the  trigonometrical  ratios 
of  77  in  terms  of  exp (777). 

228.  Definition  of  sin  £  and  cos£  for  all  values  of  (.  We  saw  in 

the  last  section  that,  when  (  is  real, 

cos  £  =  |  {exp(iC)  +exp(-7C)},  (la) 

sinC  =  -|7{exp(7C)  -exp(-iC)}.  (lb) 

The  left-hand  sides  of  these  equations  are  defined,  by  the  ordinary 
geometrical  definitions  adopted  in  elementary  Trigonometry,  only  for  real 
values  of  (.  The  right-hand  sides  have,  on  the  other  hand,  been  defined  for 
all  values  of  £,  real  or  complex.  We  are  therefore  naturally  led  to  adopt  the 
formulae  (1)  as  the  definitions  of  cos£  and  sin  £  for  all  values  of  (.  These 
definitions  agree,  in  virtue  of  the  results  of  §  227,  with  the  elementary 
definitions  for  real  values  of  £. 
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Having  defined  cos  £  and  sin  d,  we  define  the  other  trigonometrical  ratios 
by  the  equations 


.  sin  (  cos  (  1  1 

tan  (  — - cot  d  =  —  — ,  sec  (  = - cosec  C  =  t-t- 

cos  Q  sin  Q  cos  Q  sin  Q 


(2) 


It  is  evident  that  cosd  and  seed  are  even  functions  of  (,  and  sin  d,  tan  d, 
cotd,  and  cosec  (  odd  functions.  Also,  if  exp(id)  =  t,  we  have 


cosC  =  \{t  +  (1/t)},  sinC  =  -\i{t  -  (1/t)}, 
cos2  C  +  sin2  C  =  \[{t  +  (1  A)}2  ~{t~  (1  A)}2]  =  1.  (3) 


We  can  moreover  express  the  trigonometrical  functions  of  (+('  in  terms 
of  those  of  C  and  ('  by  precisely  the  same  formulae  as  those  which  hold  in 
elementary  trigonometry.  For  if  exp(yd)  =  t ,  exp( i(')  =  t/ ,  we  have 


cos(C  +  O 


cos  d  cos  ('  —  sin  (  sin 


(4) 


and  similarly  we  can  prove  that 

sin(d  +  C)  —  sin  C  cos  C  +  cos  C  sin  C-  (5) 


In  particular 


cos(d  +  ^vr)  =  —  sind,  sin(d  +  =  cos(.  (6) 

All  the  ordinary  formulae  of  elementary  Trigonometry  are  algebraical 
corollaries  of  the  equations  (2) — (6) ;  and  so  all  such  relations  hold  also  for 
the  generalised  trigonometrical  functions  defined  in  this  section. 
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229.  The  generalised  hyperbolic  functions.  In  Ex.  lxxxvii.  19, 
we  defined  cosh  £  and  sinh  £,  for  real  values  of  £,  by  the  equations 

cosh£  =  l{exp£  +  exp(— £)},  sinh £  =  |{exp £  -  exp(— £)}.  (1) 

We  can  now  extend  this  definition  to  complex  values  of  the  variable;  i.e.  we 
can  agree  that  the  equations  (1)  are  to  define  cosh£  and  sinh£  for  all  values  of  £ 
real  or  complex.  The  reader  will  easily  verify  the  following  relations: 

cosi£  =  cosh£,  sin  iQ  =  i  sinh  £,  cosh  i£  =  cos  £,  sinh  i£  =  i  sin  £. 

We  have  seen  that  any  elementary  trigonometrical  formula,  such  as  the  for¬ 
mula  cos  2£  =  cos2  £  —  sin2  £,  remains  true  when  £  is  allowed  to  assume  complex 
values.  It  remains  true  therefore  if  we  write  cos  z£  for  cos£,  sin  z£  for  sin£  and 
cos  2 i£  for  cos  2£;  or,  in  other  words,  if  we  write  cosh  £  for  cos  £,  i  sinh  (  for  sin  (, 
and  cosh  2(  for  cos  2(.  Hence 

cosh  2(  =  cosh2  (  +  sinh2  (. 

The  same  process  of  transformation  may  be  applied  to  any  trigonometrical 
identity.  It  is  of  course  this  fact  which  explains  the  correspondence  noted  in 
Ex.  lxxxvii.  21  between  the  formulae  for  the  hyperbolic  and  those  for  the  or¬ 
dinary  trigonometrical  functions. 

230.  Formulae  for  cos(£  +  ir /),  sin(£  +  it)),  etc.  It  follows  from  the 
addition  formulae  that 

cos(£  +  it))  =  cos  £  cos  ir)  —  sin  £  sin  irj  =  cos  £  cosh  r)  —  i  sin  £  sinh  r), 
sin(£  +  it))  =  sin  £  cos  ir)  +  cos  £  sin  it)  =  sin  £  cosh  r)  +  i  cos  £  sinh  i). 

These  formulae  are  true  for  all  values  of  £  and  i).  The  interesting  case  is  that  in 
which  £  and  i)  are  real.  They  then  give  expressions  for  the  real  and  imaginary 
parts  of  the  cosine  and  sine  of  a  complex  number. 

Examples  XCV.  1.  Determine  the  values  of  £  for  which  cos  £  and  sin  £ 
are  (i)  real  (ii)  purely  imaginary.  [For  example  cos  £  is  real  when  rj  =  0  or  when 
£  is  any  multiple  of  vr.] 
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2. 


|  cos(£  +  iq)  |  =  \J cos2  £  +  sinh2  q  =  \J \  (cosh  2 q  +  cos  2£ ) , 

|  sin(£  +  irj)  |  =  \J\ sin2  £  +  sinh2  q  =  \J \  (cosh  2 r]  —  cos  2£ ) . 

[Use  ( e.g .)  the  equation  |  cos(£  +  irj)\  =  y/cos(^  +  irj)  cos(£  —  iq).] 

sin  2£  +  i  sinh  2q  sin  2 £  —  i  sinh  2q 

3.  tan(4  +  irj)  =  — — — - cot(£  +  irj)  = 


[For  example 

tan(£  +  ir])  = 


cosh  2q  +  cos  2£  cosh  2i]  —  cos  2£ 

sin(£  +  ir])  cos(£  —  ir j)  sin  2£  +  sin  2 ir] 


cos(£  +  ir])  cos(£  —  irj)  cos  2£  +  cos  2 ir]  ’ 


which  leads  at  once  to  the  result  given.] 

4. 


sec(£  +  irj)  = 
cosec  +  ir])  = 


cos  £  cosh  r]  +  i  sin  £  sinh  q 
\  (cosh  2  r]  +  cos  2£)  ’ 

sin  £  cosh  q  —  i  cos  £  sinh  r/ 


l 


(cosh  2r]  —  cos  2£) 


5.  If  |  cos(£  +  iq) |  =  1  then  sin2  j  =  sinh2  q,  and  if  |  sin(£  +  iq)\  =  1  then 
cos2  £  =  sinh2  q. 

6.  If  |  cos(£  +  iq)\  =  1,  then 


sin  {am  cos(£  +  iq)}  =  ±  sin2  £  =  ±  sinh2  q. 


7.  Prove  that  Logcos(£  +  iq)  =  A  +  iB ,  where 

A  =  1  log{  \  (cosh  2q  +  cos  2£) } 

and  B  is  any  angle  such  that 

cos  B  sin  B  1 

cos  £  cosh  q  ~  sin  £  sinh  q  (cosh  2??  +  cos  2£) 

Find  a  similar  formula  for  Logsin(£  +  iq). 
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8.  Solution  of  the  equation  cost)  =  a,  where  a  is  real.  Putting 
(  =  £  +  ir),  and  equating  real  and  imaginary  parts,  we  obtain 

cos  £  cosh  i]  =  a,  sin  £  sinh  r]  =  0. 

Hence  either  rj  =  0  or  £  is  a  multiple  of  n.  If  (i)  rj  =  0  then  cos£  =  a,  which  is 
impossible  unless  —  1  ^  a  ^  1.  This  hypothesis  leads  to  the  solution 

(  =  2kir  ±  arc  cos  a, 

where  arc  cos  a  lies  between  0  and  ^  7r.  If  (ii)  £  =  rnir  then  cosh?]  =  (— l)ma,  so 
that  either  a  ^  1  and  m  is  even,  ora  ^  —1  and  m  is  odd.  If  a  =  ±1  then  ij  =  0, 
and  we  are  led  back  to  our  first  case.  If  |o|  >  1  then  cosh  ?]  =  |o|,  and  we  are  led 
to  the  solutions 


(  =  2/c7r±ilog{  a  +  \J a2  —  1}  (a  >  1), 

(  =(2  k  +  1)7 r  ±  i  log{— a  +  a2  —  1}  (a  <  — 1). 

For  example,  the  general  solution  of  cosC  =  —  |  is  £  =  (2 k  +  l)7r  ±  ilog3. 

9.  Solve  sin()  =  a,  where  a  is  real. 

10.  Solution  of  cosC  =  ol  +  i/3,  where  (3^0.  We  may  suppose  (3  >  0, 
since  the  results  when  (3  <  0  may  be  deduced  by  merely  changing  the  sign  of  i. 
In  this  case 

cos  £  cosh  r]  =  a,  sin  £  sinh  ij  =  —f3,  (1) 

and 

(a/  cosh  ?])2  +  (f3/  sinh  ?])2  =  1. 

If  we  put  cosh2  i]  =  x  we  find  that 

x2  —  (1  +  a2  +  (32)x  +  a2  =  0 
or  x  =  ( A\  ±  A2)2 ,  where 

A\  =  \  \/{a  +  l)2  +  /32,  A2  =  \  y/{a-  l)2  +  /32. 

Suppose  a  >  0.  Then  A\  >  A2  >  0  and  cosh?]  =  A\  ±  A2.  Also 


cos  £  =  a/ (cosh  rj)  =  A\  A2 
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and  since  cosh  r/  >  cos£  we  must  take 

cosh??  =  A\  +  A-2,  cos£  =  A\  —  A 2. 

The  general  solutions  of  these  equations  are 

£  =  2kn  ±  arc  cos  M,  ??  =  ±  log{L  +  \/  L2  —  1},  (2) 

where  L  =  A\  +  A2,  M  =  A\  —  A2,  and  arc  cos  M  lies  between  0  and  \n. 

The  values  of  77  and  £  thus  found  above  include,  however,  the  solutions  of 
the  equations 

cos  £  cosh  77  =  a,  sin  £  sinh  r]  =  f3,  (3) 

as  well  as  those  of  the  equations  (1),  since  we  have  only  used  the  second  of  the 
latter  equations  after  squaring  it.  To  distinguish  the  two  sets  of  solutions  we 
observe  that  the  sign  of  sin£  is  the  same  as  the  ambiguous  sign  in  the  first  of 
the  equations  (2),  and  the  sign  of  sinh  77  is  the  same  as  the  ambiguous  sign  in 
the  second.  Since  f3  >  0,  these  two  signs  must  be  different.  Hence  the  general 
solution  required  is 

£  =  2kn  ±  [arc  cos  M  —  i  log{L  +  \J  L2  —  1}]. 


11.  Work  out  the  cases  in  which  a  <  0  and  a  =  0  in  the  same  way. 

12.  If  f3  =  0  then  L  =  \\a  +  1|  +  || a  —  1|  and  M  =  || a  +  1|  —  || a  —  1|. 
Verify  that  the  results  thus  obtained  agree  with  those  of  Ex.  8. 

13.  Show  that  if  a  and  /3  are  positive  then  the  general  solution  of 
sin  £  =  a  +  i/3  is 

£  =  kn  +  (— l)fc[arcsin  M  +  zlog{L  +  \J  L2  —  1}], 


where  arc  sin  M  lies  between  0  and  ^7r.  Obtain  the  solution  in  the  other  possible 
cases. 

14.  Solve  tan£  =  a,  where  a  is  real.  [All  the  roots  are  real.] 

15.  Show  that  the  general  solution  of  tan£  =  a  +  i/3,  where  /3  /  0,  is 


c 


kn  +  \6  +  \i\og 


a2  +  (1  +  (3)2 
a2  +  (1  —  f3)2 


where  9  is  the  numerically  least  angle  such  that 

cos#  :  sin#  :  1  ::  1  —  a2  —  j32  :  2a  :  \/ (1  —  a2  —  j32)2  +  4 a2. 
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16.  If  z  =  £  exp(l7ri),  where  £  is  real,  and  c  is  also  real,  then  the  modulus 
of  cos  27 tz  —  cos  2irc  is 

\/[^{l  +  cos  47rc  +  cos(27t£\/2)  +  cosh(27r£\/2) 

—  4  cos  27tccos(7t£v/2)  cosh(7r£v/2)}]. 


17.  Prove  that 


|  exp  exp(£  +  irf)\  =  exp(exp£cos7/), 

R{coscos(£  +  z?y)}  =  cos(cos£  cosh  77)  cosh(sin£  sinh  r/) , 

I{sinsin(£  +  irj)}  =  cos(sin£coshr/)  sinh(cos£sinh?7). 

18.  Prove  that  |  exp  £|  tends  to  oo  if  £  moves  away  towards  infinity  along  any 
straight  line  through  the  origin  making  an  angle  less  than  ^7r  with  OX,  and  to  0 
if  £  moves  away  along  a  similar  line  making  an  angle  greater  than  ^7r  with  OX. 

19.  Prove  that  |  cos£|  and  |  sin£|  tend  to  oo  if  £  moves  away  towards  infinity 
along  any  straight  line  through  the  origin  other  than  either  half  of  the  real  axis. 

20.  Prove  that  tan  £  tends  to  —i  or  to  i  if  £  moves  away  to  infinity  along  the 
straight  line  of  Ex.  19,  to  —  i  if  the  line  lies  above  the  real  axis  and  to  i  if  it  lies 
below. 


231.  The  connection  between  the  logarithmic  and  the  inverse 
trigonometrical  functions.  We  found  in  Ch.  VI  that  the  integral  of  a 
rational  or  algebraical  function  a,  /3, . . .),  where  a,  /3,  ...  are  constants, 
often  assumes  different  forms  according  to  the  values  of  a,  j3,  . . . ;  sometimes  it 
can  be  expressed  by  means  of  logarithms,  and  sometimes  by  means  of  inverse 
trigonometrical  functions.  Thus,  for  example, 


/ 


dx 

x2  +  a 


1  x 

—j=  arc  tan  —= 

yJOi  y/a 


(1) 


if  a  >  0,  but 


dx 

2  +  a 


(2) 


if  a  <  0.  These  facts  suggest  the  existence  of  some  functional  connection  be¬ 
tween  the  logarithmic  and  the  inverse  circular  functions.  That  there  is  such  a 
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connection  may  also  be  inferred  from  the  facts  that  we  have  expressed  the  cir¬ 
cular  functions  of  £  in  terms  of  exp  and  that  the  logarithm  is  the  inverse  of 
the  exponential  function. 

Let  us  consider  more  particularly  the  equation 


dx  1  , 

=  2Slog 


x 2  —  a 2 


x  —  a 


x  +  a 


which  holds  when  a  is  real  and  (x  —  a)/(x  +  a)  is  positive.  If  we  could  write  ia 
instead  of  a  in  this  equation,  we  should  be  led  to  the  formula 

/x\  1  ,  f x  —  ia\  .  , 

arc  tan  (  -  )  =  —  log  — —  +  C,  (3) 

\a/  2 1  \x  +  ia  J 

where  C  is  a  constant,  and  the  question  is  suggested  whether,  now  that  we  have 
defined  the  logarithm  of  a  complex  number,  this  equation  will  not  be  found  to 
be  actually  true. 

Now  (§  221) 

Log(x  ±  ia)  =  \  log(x2  +  a2)  ±  i(</>  +  2kir), 

where  k  is  an  integer  and  cj)  is  the  numerically  least  angle  such  that 
cos  (f>  =  xj \/ x2  +  a2  and  sin  cf>  =  a/ V x2  +  a2.  Thus 


1 

2 i 


Log 


x  —  la 
x  +  ia 


—cj)  —  lir, 


where  l  is  an  integer,  and  this  does  in  fact  differ  by  a  constant  from  any  value 
of  arctan(x/a). 

The  standard  formula  connecting  the  logarithmic  and  inverse  circular  func¬ 
tions  is 


arc  tan  x  =  —  Log 
2 1 


1  +  ix 


(4) 


1  —  ix 

where  x  is  real.  It  is  most  easily  verified  by  putting  x  =  tany,  when  the  right' 
hand  side  reduces  to 


1 

2 i 


Log 


cos  y  +  i  sin  y 
cos  y  —  i  sin  y 


—  Log(exp  2  iy)  =  y  +  fo r, 

2i 


where  k  is  any  integer,  so  that  the  equation  (4)  is  ‘completely’  true  (Ex.  xcm.  3). 
The  reader  should  also  verify  the  formulae 


arc  cos  x  =  —i  Log{x  ±  i  \J  1  —  x2}, 


arc  sin  x  =  —  iLog{ix  ±  —  x2}, 


(5) 


where  —  1  fs  x  ^  1:  each  of  these  formulae  also  is  ‘completely’  true. 
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Example.  Solving  the  equation 

cos  u  =  x=\{y  +  (1  /y)}, 

where  y  =  exp(ut),  with  respect  to  y,  we  obtain  y  =  x  ±  iV  1  —  x2.  Thus: 
u  =  —  iLogy  =  —  iLog{x  ±  i\J  1  —  x2}, 

which  is  equivalent  to  the  first  of  the  equations  (5).  Obtain  the  remaining 
equations  (4)  and  (5)  by  similar  reasoning. 


232.  The  power  series  for  exp  z*  We  saw  in  §  212  that  when  z  is 
real 

~2 

expz  =  l  +  z  +  —  + -  (1) 

Moreover  we  saw  in  §  191  that  the  series  on  the  right-hand  side  remains 
convergent  (indeed  absolutely  convergent)  when  z  is  complex.  It  is  natu¬ 
rally  suggested  that  the  equation  (1)  also  remains  true,  and  we  shall  now 
prove  that  this  is  the  case. 

Let  the  sum  of  the  series  (1)  be  denoted  by  F(z).  The  series  being  ab¬ 
solutely  convergent,  it  follows  by  direct  multiplication  (as  in  Ex.  lxxxi.  7) 
that  F(z)  satisfies  the  functional  equation 

F{z)F{h)  =  F(z  +  h).  (2) 

Now  let  0  =  iy,  where  y  is  real,  and  F(z)  =  f(y).  Then 

f(y)m  =  ny  +  k); 


and  so 


But 


f(y  +  k)-f(y) 


k 


=  f(y ) 


m  - 1 

k 


m  - 1 

k 


...  ik  ( ik )2 

=  z<!  +  2!  +  ^r  +  --- 


*It  will  be  convenient  now  to  use  z  instead  of  (  as  the  argument  of  the  exponential 
function. 
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and  so,  if  \k\  <  1, 


m  - 1 

k 


< 


<  (e  —  2)|/c|. 


Hence  {f(k)  —  l}//c  — >  i  as  k  — >  0,  and  so 


f(y) 


Um  f(y  +  k)-f(y) 
k->  o  k 


if{y)- 


(3) 


Now 

f(y)  =  F(w)  =  1  +  (iy)  +  pp-  +  •  ■  •  =  <t>{y)  +  #(y), 


where  <f)(y)  is  an  even  and  ^(y)  an  odd  function  of  y,  and  so 

\f(y)\  =  ViHv)}2  +  ftK y )}2 

=  V {<P(y)  +  #(y)H0(y)  -  #(y)} 

=  y/F(iy)F(-iy)  =  y/¥[ 0)  =  1; 


and  therefore 


/(y)  =  cos  y  +  i  sin  F, 


where  F  is  a  function  of  y  such  that  —7 r  <  F  ^  7r.  Since  /(y)  has  a 
differential  coefficient,  its  real  and  imaginary  parts  cosF  and  sinF  have 
differential  coefficients,  and  are  a  fortiori  continuous  functions  of  y.  Hence 
Y  is  a  continuous  function  of  y.  Suppose  that  Y  changes  to  F  +  K  when 
y  changes  to  y  +  k.  Then  K  tends  to  zero  with  k,  and 


I<  _  |  cos(F  +  K)  -  cos  F 1  /  f  cos(F  +  iC)  -  cos  F  1 


Of  the  two  quotients  on  the  right-hand  side  the  first  tends  to  a  limit  when 
k  — y  0,  since  cosF  has  a  differential  coefficient  with  respect  to  y,  and  the 
second  tends  to  the  limit  —  sinF.  Hence  K/k  tends  to  a  limit,  so  that 
F  has  a  differential  coefficient  with  respect  to  y. 
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Further 


clY 

f'(y )  =  (—  sin  Y  +  i  cos  Y)- — . 

dy 


But  we  have  seen  already  that 

f\y)  =  if(y)  =  —  sin  Y  +  i  cos  Y. 


Hence 


dY 

dy 


_  =  1,  Y  =  y  +  C , 


where  C  is  a  constant,  and 

f(y)  =  cos  (y  +  C)  +  ism(y  +  C). 

But  /( 0)  =  1  when  y  —  0,  so  that  C  is  a  multiple  of  2i r,  and 
f(y)  =  cos  y  +  ismy.  Tims  F(iy)  =  cos  y  +  i  sin  y  for  all  real  values  of  y. 
And,  if  x  also  is  real,  we  have 

F(x  +  iy )  =  F(x)F{iy)  =  exp  x  {cosy  +  ismy )  =  exp  (a;  +  iy), 


or 

exp  Z  —  1  +  Z+  —  +  ..., 

for  all  values  of  z. 


233.  The  power  series  for  cosz  and  sinz.  From  the  result  of  the 
last  section  and  the  equations  (1)  of  §  228  it  follows  at  once  that 


cosz  = 


2!  +  4! 


^3  ^5 

“=I-j+r 


for  all  values  of  0.  These  results  were  proved  for  real  values  of  z  in  Ex.  lvi.  1. 


[X :  233] 


EXPONENTIAL,  AND  CIRCULAR  FUNCTIONS 


523 


Examples  XCVI.  1.  Calculate  cos  i  and  sin  i  to  two  places  of  decimals 
by  means  of  the  power  series  for  cos  z  and  sin  z. 

2.  Prove  that  |  cosz|  5^  cosh|2;|  and  |  sinz|  5=  sinh|z|. 

3.  Prove  that  if  \z\  <  1  then  |  cos^l  <  2  and  |  sin z\  <  ||z|. 

4.  Since  sin  2z  =  2  sin  z  cos  z  we  have 


(2*)- 


(2 z)3  (2 zf  „  {  zr  \  ( _  z 


3! 


+ 


5!  2  V  3!  +"'  J  l1  2!  +"'  1  ' 


Prove  by  multiplying  the  two  series  on  the  right-hand  side  (§  195)  and  equating 
coefficients  (§  194)  that 

Verify  the  result  by  means  of  the  binomial  theorem.  Derive  similar  identities 
from  the  equations 

0.0  o  o 

cos  z  +  sin  z  =  1,  cos  2z  =  2  cos^  z  —  1  =  1  —  2  sin  z. 


5.  Show  that 


exp{(l +  *)*}  =  £;  23"  exp(i 


Z " 

I  - 

n\ 


6.  Expand  cos  z  cosh  z  in  powers  of  2.  [We  have 
coszcosh.2  +  isinzsinhz  =  cos{(l  —  i)z}  =  ^[exp{(l  +  i)z}  +  exp{— (1  +  i)z}} 

OO 

=  ^2i”{l  +  (-l)”}exp(ii 


*  n 


z 

n\ 


and  similarly 

OO 

cos  z  cosh  z  —  i  sinz  sinhz  =  cos(l  +  i)z  =  \  2an{l  +  (— l)n}  exp(— \niri) 

o 

Hence 

22z4  2 V 


n\ 


cos  z  cosh  2  =  |  X  22?1{1  +  (—I)"-}  cos  =  1  —  — b 


4! 


8! 
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7.  Expand  sinzsinhz,  coszsinhz,  and  sin  z  cosh  z  in  powers  of  z. 

8.  Expand  sin2  z  and  sin3  z  in  powers  of  z.  [Use  the  formulae 

sin2  z  =  |(1  —  cos  2 z) ,  sin3  z  =  \ (3  sin  z  —  sin  3 z) ,  ... . 

It  is  clear  that  the  same  method  may  be  used  to  expand  cosn  z  and  sin71  z,  where 
n  is  any  integer.] 

9.  Sum  the  series 

^  cosz  cos2z  cos3z  „  sinz  sin2z  sin3z 

C  =  l  +  — — +  — —  +  — —  +  ...,  S  =  — - — I - — - 1 - — - 1-  — 


1! 


2! 


3! 


2! 


3! 


[Here 


„  ,  exp(iz)  exp(2zz)  r  , . 

C  +  iS  =  1  H - — — -  H - — — -  H - =  expjexp(zz)} 

=  exp(cos  z){cos(sin  z)  +  i  sin(sin  z)}, 


and  similarly 


C  —  iS  =  exp{exp(— iz)}  =  exp(cosz){cos(sinz)  —  zsin(sinz)}. 


Hence 


C  =  exp(cosz)  cos(sinz),  S  =  exp(cosz)  sin(sinz).] 


10.  Sum 


1  + 


a  cos  z  a 2  cos  2z 


1! 


+ 


2! 


+  . . . , 


a  sin  z  a2  sin  2z 


1! 


+ 


2! 


+ 


11.  Sum 


cos  2z  cos  4z 
1 - ^  + 


cos  z  cos  3z 


2!  '  4!  1!  3! 

and  the  corresponding  series  involving  sines. 

12.  Show  that 
cos  4z  cos  8z 


+ 


1  + 


4! 


H - — - {-■■■  =  i{cos(cos  z)  cosh(sin  z)  +  cos(sin z)  cosh(cos  z)}. 

8! 


13.  Show  that  the  expansions  of  cos(x  +  h)  and  sin(x  +  h)  in  powers  of  h 
(Ex.  LVi.  1)  are  valid  for  all  values  of  x  and  h,  real  or  complex. 
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234.  The  logarithmic  series.  We  found  in  §  213  that 


log(l  +  z)  =  z  -  ±z2  +  ±z3  -  .  .  . 


(1) 


when  z  is  real  and  numerically  less  than  unity.  The  series  on  the  right-hand 
side  is  convergent,  indeed  absolutely  convergent,  when  z  has  any  complex 


value  whose  modulus  is  less  than  unity.  It  is  naturally  suggested  that  the 


equation  (1)  remains  true  for  such  complex  values  of  z.  That  this  is  true 
may  be  proved  by  a  modification  of  the  argument  of  §  213.  We  shall  in  fact 
prove  rather  more  than  this,  viz.  that  (1)  is  true  for  all  values  of  z  such 
that  \z\  ^  1,  with  the  exception  of  the  value  —1. 

It  will  be  remembered  that  log(l  +  z)  is  the  principal  value  of  Log(l  +  z), 
and  that 


where  C  is  the  straight  line  joining  the  points  1  and  l  +  z  in  the  plane  of  the 
complex  variable  u.  We  may  suppose  that  z  is  not  real,  as  the  formula  (1) 
has  been  proved  already  for  real  values  of  z. 


If  we  put 


z  =  r  (cos  6  +  i  sin  6)  =  (r, 


so  that  |r|  ^  1,  and 


u  =  l  +  (t, 


then  u  will  describe  C  as  t  increases  from  0  to  r.  And 


f  du  r  C  dt 

Jc  u  Jo  i  +  C* 


L 


f 

Jo 

/ 


•r 


i 


C-C2*  +  C3*2-.--  +  (-i) 


(Crf  (Crf 
s  2  3 


- 1-  (-l)m_1LA^ - b  R, 

m 


...  +  (_ 


(2) 


where 


(3) 
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It  follows  from  (1)  of  §  164  that 


\R  I  < 

1  Lm.  — 


tmdt 

|i  +  C^l 


(4) 


Now  |1  +  (t\  or  \u\  is  never  less  than  xzr,  the  perpendicular  from  O  on  to 
the  line  C*  Hence 


„m+ 1 


\RJ  <  -  /  tmdt  = 


< 


zu 


(m  +  l)w  (m  +  l)ru’ 


and  so  Rm  — >  0  as  m  — >  oo.  It  follows  from  (2)  that 


log(l  +  z)  =  z  -  h2  +  h3 


(5) 


We  have  of  course  shown  in  the  course  of  our  proof  that  the  series  is 
convergent:  this  however  has  been  proved  already  (Ex.  LXXX.  4).  The  series 
is  in  fact  absolutely  convergent  when  |z|  <  1  and  conditionally  convergent 
when  1 2 1  =  1. 

Changing  z  into  —  z  we  obtain 


log 


=  -log(l  -  z) 


£+T22+TL3  +  .... 


(6) 


235.  Now 

log(l  +  z)  =  log{(l  +  r  cos  6)  +  ir  sin  6} 

1  2  (  r  sin  6  \ 

=  o  log(  1  +  2r  cos  0  +  r" )  +  i  arc  tan  - -  . 

2  v  \  i  +  r  cos  6  J 

That  value  of  the  inverse  tangent  must  be  taken  which  lies  between 
and  \tz.  For,  since  1  -j-z  is  the  vector  represented  by  the  line  from  —1  to  z, 
the  principal  value  of  am(l  +  z)  always  lies  between  these  limits  when  z  lies 
within  the  circle  |^|  =  l.t 

*  Since  z  is  not  real,  C  cannot  pass  through  O  when  produced.  The  reader  is  rec¬ 
ommended  to  draw  a  figure  to  illustrate  the  argument. 

^See  the  preceding  footnote. 
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Since  zm  =  rm(cosm9  +  ismm9),  we  obtain,  on  equating  the  real  and 
imaginary  parts  in  equation  (5)  of  §  234, 

r  cos  6  —  \r2  cos  29  +  |r3  cos  39  —  , 

r  sin  9  —  ^r2  sin  29  +  |r3  sin  39  —  ... . 

These  equations  hold  when  0  ^  r  ^  1,  and  for  all  values  of  9,  except  that, 
when  r  —  1,  9  must  not  be  equal  to  an  odd  multiple  of  tt.  It  is  easy  to  see 
that  they  also  hold  when  —  1  ^  r  ^  0,  except  that,  when  r  =  —  1,  9  must 
not  be  equal  to  an  even  multiple  of  tt. 

A  particularly  interesting  case  is  that  in  which  r  —  1.  In  this  case  we 
have 

f  sin  0  \ 

log(l  +  z)  —  log(l  +  Cis  9)  =  |  log(2  +  2  cos  9)  +  i  arc  tan  - - 

\  1  +  cos  9  ) 

=  \  log(4cos2  \ 9 )  +  \%9, 


\  log(l  +  2 r  cos  9  +  r2)  = 

f  r  sin  9  \ 

arc  tan  - -  = 

\  1  +  r  cos  9  J 


if  —  tt  <  9  <  n,  and  so 

cos  9  —  \  cos  2 9  +  |  cos  36*  —  . . .  =  |  log(4cos2  |0), 

sin  9  —  \  sin  29  +  |  sin  39  —  ...  =  \9. 

The  sums  of  the  series,  for  other  values  of  9,  are  easily  found  from  the 
consideration  that  they  are  periodic  functions  of  9  with  the  period  27t. 
Thus  the  sum  of  the  cosine  series  is  \  log(4cos2  \ 9 )  for  all  values  of  9  save 
odd  multiples  of  tt  (for  which  values  the  series  is  divergent),  while  the  sum 

of  the  sine  series  is  ^(9  —  2kir)  if  (2k  —  l)7r  <  9  <  (2k  +  l)7r,  and  zero  if  9  is 

an  odd  multiple  of  tt.  The  graph  of  the  function  represented  by  the  sine 
series  is  shown  in  Fig.  58.  The  function  is  discontinuous  for  9  =  (2k  +  l)7r. 


If  we  write  iz  and  —  iz  for  z  in  (5),  and  subtract,  we  obtain 

1  +  iz' 


h]og 


1  —  iz 


=  z-±z3  +  lz5-.... 
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Fig.  58. 


If  z  is  real  and  numerically  less  than  unity,  we  are  led,  by  the  results  of  §  231, 
to  the  formula 

arc  tan  z  =  z  —  ^ z 3  +  —  . . . , 

already  proved  in  a  different  manner  in  §  214. 


Examples  XCVII.  1.  Prove  that,  in  any  triangle  in  which  a  >  b, 

b  b2 

log  c  =  log  a - cos  C - ~  cos  2C  —  . . . . 

a  2  az 

[Use  the  formula  logc  =  \  log(a2  +  b2  —  2abcosC).] 

2.  Prove  that  if  — 1  <  r  <  1  and  —  <  6  <  \ts  then 


r  sin  26  —  ^r2  sin  46  +  ^ r 3  sin  66  —  ■■■  =  6  —  arc  tan 


1  —  r 
1  +  r 


tan  6 


the  inverse  tangent  lying  between  —  jvr  and  \tt.  Determine  the  sum  of  the  series 
for  all  other  values  of  6. 

3.  Prove,  by  considering  the  expansions  of  log(l  +  iz)  and  log(l  —  iz)  in 
powers  of  z,  that  if  —  1  <  r  <  1  then 


r  sin  6  +  ^r2  cos  26  —  ir3  sin 36  —  |r4  cos  46  +  . . .  =  ^  log(l  +  2 r  sin#  +  r2), 


r  cos  6  +  ^r2  sin  26  —  ^r3  cos  36  —  ^-r4  sin  46  +  . . .  =  arc  tan 


r  cos  6 
1  —  r  sin  ( 
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r  sin  0  —  ^r3  sin  30  + 

r  cos  0  —  ^r3  cos  30  + 

the  inverse  tangents  lying  between 
4.  Prove  that 

cos  0  cos  0  —  j  cos  20  cos2  0  +  g  cos  30  cos3  0  —  . . .  =  \  log(l  +  3  cos2  0), 

sin 0  sin  0  —  5  sin  20  sin2  0+5  sin 30 sin3  0  —  . . .  =  arc cot(l  +  cot  0  +  cot2  0), 

the  inverse  cotangent  lying  between  —  \tt  and  r;  and  find  similar  expressions 
for  the  sums  of  the  series 

cos  0  sin  0  —  |  cos  20  sin2  6  +  ... ,  sin  0  cos  9  —  \  sin  20  cos2  9  +  ... . 


■■■  =  z  log 


1  +  2r  sin  0  +  r2 
1  —  2r  sin  0  +  r2 
2r  cos  0 


=  ^  arc  tan  ,  „ 

z  '1  —  7*^ 


-57r  and  \it. 


236.  Some  applications  of  the  logarithmic  series.  The  expo¬ 
nential  limit.  Let  z  be  any  complex  number,  and  h  a  real  number  small 
enough  to  ensure  that  \hz\  <  1.  Then 

log(l  +  hz)  =  hz  —  \{hz )2  +  l(hz)3  —  . . . , 


and  so 


where 


log(l  +  hz) 
h 


=  z  +  cj)(h,  z), 


4>(h,  z)  =  —\hz  +  \h  z6  —  \lrz  + 

| <f>(h,  z) |  <  \hz2\(l  +  \hz\  +  \h2z2\  +  ...)  = 


so  that  4>(h,  z)  — >  0  as  h  — >  0.  It  follows  that 


log(l  +  hz) 

inn - - - =  z. 

h — >0  h 


|  hz2 


1  —  \hz\ 


(1) 
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If  in  particular  we  suppose  h  =  1/n,  where  n  is  a  positive  integer,  we 
obtain 


lim  n  log  ( 1  H —  )  —  z, 


and  so 


lim  (l  H — )  =  lim  exp  {nlog  (l  H — )  }  =  expz. 

n—toc  V  77,/  n— »oo  l  V  TlJ  J 


(2) 


This  is  a  generalisation  of  the  result  proved  in  §  208  for  real  values  of  z. 

From  (1)  we  can  deduce  some  other  results  which  we  shall  require  in 
the  next  section.  If  f  and  h  are  real,  and  h  is  sufficiently  small,  we  have 

log(l  T  tz  +  hz)  —  log(l  +  tz)  1.  /  hz 

-  log  1  + 


h  h 

which  tends  to  the  limit  z/(  1  +  tz)  as  h  — >  0.  Hence 

d  ,,  _  2; 


1  +  tz 


(3) 


We  shall  also  require  a  formula  for  the  differentiation  of  (1 +tz)m,  where 
m  is  any  number  real  or  complex,  with  respect  to  t.  We  observe  first  that, 
if  0(f)  =  0(f)  +  ix( t)  is  a  complex  function  of  f,  whose  real  and  imaginary 
parts  0(f)  and  y(f)  possess  derivatives,  then 

(exp  0)  =  ^  { (cos  x  +  *  sin  x)  exp  0 } 

=  {(cos  x  +  i  sin  a)07  +  (—  si11  X  +  * cos  x)x'}  exP  0 
=  (0;  +  ix')  (cos  x  +  i  sin  x)  exp  0 
=  (0'  +  w')  exp(0  +  ix)  =  0'  exp  0, 


so  that  the  rule  for  differentiating  exp  0  is  the  same  as  when  0  is  real.  This 
being  so  we  have 


d 

dt 


(1  +  tz)m 


—  exp{m  log(l  +  tz)} 

CIL 

TflZ 

exp{mlog(l  +  tz)} 


1  +  tz 
mz{  1  +  tz)m -1 


Here  both  (1  +  tz)m  and  (1  +  tz)m  1  have  their  principal  values. 


(4) 
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237.  The  general  form  of  the  Binomial  Theorem.  We  have 
proved  already  (§  215)  that  the  sum  of  the  series 


is  (1  +  z)m  =  exp{m  log(l  +  z)},  for  all  real  values  of  m  and  all  real  values 
of  z  between  —1  and  1.  If  an  is  the  coefficient  of  zn  then 

®n+ 1 
Cln 

whether  m  is  real  or  complex.  Hence  (Ex.  LXXX.  3)  the  series  is  always 
convergent  if  the  modulus  of  z  is  less  than  unity,  and  we  shall  now  prove 
that  its  sum  is  still  exp{mlog(l  +  z)},  i.e.  the  principal  value  of  (1  +  z)m. 
It  follows  from  §  236  that  if  t  is  real  then 

~~  (IT  tz)m  =  mz(  1  +  tz)m -1, 

z  and  m  having  any  real  or  complex  values  and  each  side  having  its  principal 
value.  Hence,  if  4>(t)  =  (1  +  tz)m,  we  have 

</,H(t)  =  m(m  -  1  )...{m-n+  l)zn(l  +  tz)m~n. 


m  —  n 


n 


This  formula  still  holds  if  t  —  0,  so  that 

r(0)  fm\  ; 

n\  \n ) 


Now,  in  virtue  of  the  remark  made  at  the  end  of  §  164,  we  have 


0(l)  =  0(O)  +  0/(O)  + 


'(0) 


2! 


+  •••  + 


(n  —  1)! 


+  Rn 


Rn  = 


(n  —  1)! 


(1  -t)n~l^n\t)dt. 


where 
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But  if  z  —  r(cos  9  +  i  sin  6)  then 

|1  +  tz\  =  Vl  +  2tr  cos  9  +  t2r2  ^  1  —  tr, 


and  therefore 


\Rn\  < 


< 


| m(m  —  1) . . .  (m  —  n  +  1)  |  n  f1  (1  —  t) 


n — 1 


(n  —  1)! 


/ o  (i  -try 


dt 


| m(m  —  1) ...  (m  —  n  +  1)|  (1  —  9)n  1rn 


(1  —  9r)r 


(■ n  —  1)! 

where  0  <  9  <  1;  so  that  (cf.  §  163) 

|  m(m  —  1) . . .  (m  —  n  +  1)  | 


Rn  <  K  - 


(n  —  1)! 


r  =  pr 


say.  But 


Pn+l 

Pn 


m  —  n\ 

- r  — >  r, 

n 


and  so  (Ex.  xxvil.  6)  pn  — >  0,  and  therefore  Rn  — >  0,  as  n  — >  oo.  Hence 
we  arrive  at  the  following  theorem. 

Theorem.  The  sum  of  the  binomial  series 


1  + 


z2  + 


is  exp{mlog(l  +  z)},  where  the  logarithm  has  its  principal  value,  for  all 
values  of  m,  real  or  complex,  and  all  values  of  z  such  that  \z\  <  1. 

A  more  complete  discussion  of  the  binomial  series,  taking  account  of 
the  more  difficult  case  in  which  \z\  =  1,  will  be  found  on  pp.  225  et  seq.  of 
Bromwich’s  Infinite  Series. 


Examples  XCVIII.  1.  Suppose  m  real.  Then  since 

(  v  sm  0 

log(l  +  z)  =  \  log(l  +  2r  cos  6  +  r2)  +  i  arc  tan 


1  +  r  cos  9 
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we  obtain 


exp{^mlog(l  +  2r  cos0  +  r2)}  Cis  |marctan  ^ 
(1  +  2 r  cos  9  +  r2)  5m  Cis  /  m  arc  tan 


r  sin  9 
+  r  cos  9 


r  sin  9 


1  +  r  cos  9 


all  the  inverse  tangents  lying  between  —  ^tt  and  r.  In  particular,  if  we  suppose 
9  =  ^7 r,  z  =  ir,  and  equate  the  real  and  imaginary  parts,  we  obtain 


1  - 


(1  +  r2)5m 


cos(m  arc  tan  r), 


(l  +  r2)|m 


sin(m  arc  tan  r). 


2.  Verify  the  formulae  of  Ex.  1  when  m  =  1,  2,  3.  [Of  course  when  m  is  a 
positive  integer  the  series  is  finite.] 

3.  Prove  that  if  0  <  r  <  1  then 


1-3  2  ,  1  •  3  •  5  •  7  4  iVl  +  ^+1 

2  •  4?  +  2-4-6-8r  “  y  2(1  +  r2)  ’ 

1  1-3-5  o  1-3-5-7-9  ,  /  x/1  +  r2  -  1 

2 7  _  2  •  4  •  6  r  +  2  •  4  •  6  •  8  •  10  r  _  '  "  ~  \j  2(1  +  r2) 

[Take  m  =  —  ^  in  the  last  two  formulae  of  Ex.  1.] 

4.  Prove  that  if  —  ^7T  <  9  <  then 


cos  m9  =  cosm  9  {  1  — 
sin  m9  =  cosm  9 


m 


tan  9  + 


tan  9  — 


m 


tan  9  — 


tarn  9  + 


for  all  real  values  of  m.  [These  results  follow  at  once  from  the  equations 


cos  mO  +  i  sin  mO  =  (cos  9  +  i  sin  9)m  =  cosm  0(1  + i  tan  9)m .] 
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5.  We  proved  (Ex.  lxxxi.  6),  by  direct  multiplication  of  series,  that 

(TTl\ 

Jzn,  where  \z\  <  1,  satisfies  the  functional  equation 
f(m,  z)f(m',  z)  =  f(m  +  m',  z). 

Deduce,  by  an  argument  similar  to  that  of  §  216,  and  without  assuming  the 
general  result  of  p.  532,  that  if  m  is  real  and  rational  then 

f(m,  z)  =  exp{m  log(l  +  z)}. 

6.  If  z  and  )i  are  real,  and  —  1  <  z  <  1,  then 


E 


zn  =  cos{^log(l  +  z)}  +  zsin{/xlog(l  +  z)}. 


MISCELLANEOUS  EXAMPLES  ON  CHAPTER  X. 

1.  Show  that  the  real  part  of  A)g(1+d  is 

e(4fc+l)^2/8  cog{  1  (4A.  +  1)7r  bg  2}; 

where  k  is  any  integer. 

2.  If  a  cos  9  +  b  sin  6  +  c  =  0,  where  a,  b,  c  are  real  and  c2  >  a2  +  b2,  then 


9  =  mn  +  a  ±  i  log 


+  y/c2  —  a2  —  b2 


Va 2  +  b2 

where  m  is  any  odd  or  any  even  integer,  according  as  c  is  positive  or  negative, 
and  a  is  an  angle  whose  cosine  and  sine  are  a/  \J a2  +  b2  and  6/  yj a2  +  b2 . 

3.  Prove  that  if  9  is  real  and  sin^siniji  =  1  then 

(f>  =  (k  +  ±  i  log  cot  \  {kTt  +  9), 

where  k  is  any  even  or  any  odd  integer,  according  as  sin#  is  positive  or  negative. 

4.  Show  that  if  x  is  real  then 

—  exp{(a  +  ib)x}  =  (a  +  ib)  exp{(a  +  ib)x}, 
ax 


exp{(a  +  ib)x}  dx  = 


exp  (a  +  ib)x 
a  +  ib 
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Deduce  the  results  of  Ex.  lxxxvii.  3. 

°  1 

5.  Show  that  if  a  >  0  then  /  exp{  —  (a  +  ib)x}  dx  =  - — ,  and  deduce 

Jo  a  +  ib 

the  results  of  Ex.  LXXXVII.  5. 

6.  Show  that  if  ( x/a )2  +  ( y/b )2  =  1  is  the  equation  of  an  ellipse,  and  f(x,  y) 
denotes  the  terms  of  highest  degree  in  the  equation  of  any  other  algebraic  curve, 
then  the  sum  of  the  eccentric  angles  of  the  points  of  intersection  of  the  ellipse 
and  the  curve  differs  by  a  multiple  of  27t  from 


-i{log  /(a,  ib)  -  log  /(a,  -ib)}. 

[The  eccentric  angles  are  given  by  f(a  cos  a,  b  sin  a)  +  •  •  •  =  0  or  by 

where  u  =  exp  ia\  and  a  is  equal  to  one  of  the  values  of  —i  Log  P,  where  P  is 
the  product  of  the  roots  of  this  equation.] 

7.  Determine  the  number  and  approximate  positions  of  the  roots  of  the 
equation  tan  2  =  az,  where  a  is  real. 

[We  know  already  (Ex.  xvn.  4)  that  the  equation  has  infinitely  many  real 
roots.  Now  let  z  =  x  +  iy,  and  equate  real  and  imaginary  parts.  We  obtain 

sin  2x/(cos  2x  +  cosh  2  y)  =  ax,  sinh  2  y/ (cos  2x  +  cosh  2  y)  =  ay, 

so  that,  unless  x  or  y  is  zero,  we  have 


(sin2x)/2x  =  (sinh2y)/2y. 

This  is  impossible,  the  left-hand  side  being  numerically  less,  and  the  right-hand 
side  numerically  greater  than  unity.  Thus  x  =  0  or  y  =  0.  Ify  =  0we  come 
back  to  the  real  roots  of  the  equation.  If  x  =  (1  then  tanh  y  =  ay.  It  is  easy  to 
see  that  this  equation  has  no  real  root  other  than  zero  if  a  ^  0  or  a  ^  1,  and  two 
such  roots  if  0  <  a  <  1.  Thus  there  are  two  purely  imaginary  roots  if  0  <  a  <  1; 
otherwise  all  the  roots  are  real.] 

8.  The  equation  tan  z  =  az  +  b,  where  a  and  b  are  real  and  b  is  not  equal 
to  zero,  has  no  complex  roots  if  a  ^  0.  If  a  >  0  then  the  real  parts  of  all  the 
complex  roots  are  numerically  greater  than  |6/2a|. 
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9.  The  equation  tan  z  =  a/z,  where  a  is  real,  has  no  complex  roots,  but 
has  two  purely  imaginary  roots  if  a  <  0. 

10.  The  equation  tan  z  =  atanhcz,  where  a  and  c  are  real,  has  an  infinity 
of  real  and  of  purely  imaginary  roots,  but  no  complex  roots. 

1 1 .  Show  that  if  x  is  real  then 

e-cos6x  =  t^{«»-(")a"-V+Q 


where  there  are  ^  (n  +  1)  or  |(n  +  2)  terms  inside  the  large  brackets.  Find  a 
similar  series  for  eax  sin  bx. 

12.  If  ruf>(z,  n)  — >•  z  as  n  — >  oo,  then  {1  +  cf>(z ,  n)}n  — >  exp  z. 

13.  If  cf>(t)  is  a  complex  function  of  the  real  variable  t,  then 


d 

dt 


log  <J>(t) 


m 

<Kt) ' 


[Use  the  formulae 

4 >  =  ^  +  *X ,  log  4>  =  \  l°g(V;2  +  X2)  +  *  arctan(x/,0).] 


14.  Transformations.  In  Ch.  Ill  (Exs.  XXI.  21  et  seq.,  and  Misc.  Exs.  22  et 
seq.)  we  considered  some  simple  examples  of  the  geometrical  relations  between 
figures  in  the  planes  of  two  variables  z,  Z  connected  by  a  relation  z  =  f(Z). 
We  shall  now  consider  some  cases  in  which  the  relation  involves  logarithmic, 
exponential,  or  circular  functions. 

Suppose  firstly  that 

z  =  exp(7rZ/a),  Z  =  (a/ir)  Log  z 

where  a  is  positive.  To  one  value  of  Z  corresponds  one  of  z,  but  to  one  of  z 
infinitely  many  of  Z.  If  x,  y ,  r,  9  are  the  coordinates  of  z  and  X,  Y,  R,  0  those 
of  Z,  we  have  the  relations 

x  =  e7rA,/a  cos(7rU/a),  y  =  enX^a  sin(7ry/a), 

X  =  (a/ir)  logr,  Y  =  (a9/ir)  +  2ka, 

where  k  is  any  integer.  If  we  suppose  that  —  n  <  9  ^  n,  and  that  Log  z  has 
its  principal  value  logz,  then  k  =  0,  and  Z  is  confined  to  a  strip  of  its  plane 
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parallel  to  the  axis  OX  and  extending  to  a  distance  a  from  it  on  each  side,  one 
point  of  this  strip  corresponding  to  one  of  the  whole  2-plane,  and  conversely. 
By  taking  a  value  of  Log  z  other  than  the  principal  value  we  obtain  a  similar 
relation  between  the  2-plane  and  another  strip  of  breadth  2a  in  the  Z-plane. 

To  the  lines  in  the  Z-plane  for  which  X  and  Y  are  constant  correspond  the 
circles  and  radii  vectores  in  the  2-plane  for  which  r  and  0  are  constant.  To  one 
of  the  latter  lines  corresponds  the  whole  of  a  parallel  to  OX,  but  to  a  circle  for 
which  r  is  constant  corresponds  only  a  part,  of  length  2 a,  of  a  parallel  to  OY.  To 
make  Z  describe  the  whole  of  the  latter  line  we  must  make  2  move  continually 
round  and  round  the  circle. 

15.  Show  that  to  a  straight  line  in  the  Z-plane  corresponds  an  equiangular 
spiral  in  the  2-plane. 

16.  Discuss  similarly  the  transformation  2  =  c  cosh(7r Z / a) ,  showing  in  par¬ 
ticular  that  the  whole  2-plane  corresponds  to  any  one  of  an  infinite  number  of 
strips  in  the  Z-plane,  each  parallel  to  the  axis  OX  and  of  breadth  2a.  Show  also 
that  to  the  line  X  =  Xq  corresponds  the  ellipse 

x  \2  /  y 

c  cosh(7rXo /a)  /  \c  sinh(7rXo / a) 

and  that  for  different  values  of  Xo  these  ellipses  form  a  confocal  system;  and  that 
the  lines  Y  =  Yq  correspond  to  the  associated  system  of  confocal  hyperbolas. 
Trace  the  variation  of  2  as  Z  describes  the  whole  of  a  line  X  =  Xq  or  Y  =  Yq. 
How  does  Z  vary  as  2  describes  the  degenerate  ellipse  and  hyperbola  formed  by 
the  segment  between  the  foci  of  the  confocal  system  and  the  remaining  segments 
of  the  axis  of  x? 

17.  Verify  that  the  results  of  Ex.  16  are  in  agreement  with  those  of  Ex.  14 
and  those  of  Ch.  Ill,  Misc.  Ex.  25.  [The  transformation  2  =  ccosh(7rZ/a)  may 
be  regarded  as  compounded  from  the  transformations 

2  =  C2i,  21  =  \{z2  +  (I/22)},  22  =  exp(7rZ/a).] 

18.  Discuss  similarly  the  transformation  2  =  ctanh(-7rZ/a),  showing  that  to 
the  lines  X  =  Xq  correspond  the  coaxal  circles 

{x  —  ccoth(27rA'o/a)}2  +  y2  =  c2  cosech2(27rA'o/a), 
and  to  the  lines  Y  =  Yq  the  orthogonal  system  of  coaxal  circles. 
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19.  The  Stereographic  and  Mercator’s  Projections.  The  points  of  a 
unit  sphere  whose  centre  is  the  origin  are  projected  from  the  south  pole  (whose 
coordinates  are  0,  0,  —1)  on  to  the  tangent  plane  at  the  north  pole.  The  coordi¬ 
nates  of  a  point  on  the  sphere  are  £,  r/,  (,  and  Cartesian  axes  OX,  OY  are  taken 
on  the  tangent  plane,  parallel  to  the  axes  of  £  and  y.  Show  that  the  coordinates 
of  the  projection  of  the  point  are 


x  =  2£/(l  +  C),  y  =  2r7/(l  +  C), 

and  that  x  +  iy  =  2  tan  |$Cis</>,  where  </>  is  the  longitude  (measured  from  the 
plane  7]  =  0)  and  0  the  north  polar  distance  of  the  point  on  the  sphere. 

This  projection  gives  a  map  of  the  sphere  on  the  tangent  plane,  generally 
known  as  the  Stereographic  Projection.  If  now  we  introduce  a  new  complex 
variable 

Z  =  X  +  iY  =  —  i  log  \z  =  —  i  log  \  (x  +  iy) 

so  that  X  =  <j),Y  =  log  cot  \0,  we  obtain  another  map  in  the  plane  of  Z ,  usually 
called  Mercator’s  Projection.  In  this  map  parallels  of  latitude  and  longitude  are 
represented  by  straight  lines  parallel  to  the  axes  of  X  and  Y  respectively. 

20.  Discuss  the  transformation  given  by  the  equation 


z  =  Log 


Z  —  a 
Z-b 


showing  that  the  straight  lines  for  which  x  and  y  are  constant  correspond  to  two 
orthogonal  systems  of  coaxal  circles  in  the  Z-plane. 

21.  Discuss  the  transformation 


z  =  Log 


V7 Z  —  a  +  \J  Z  —  b 
\Jb  —  a 


showing  that  the  straight  lines  for  which  x  and  y  are  constant  correspond  to  sets 
of  confocal  ellipses  and  hyperbolas  whose  foci  are  the  points  Z  =  a  and  Z  =  b. 
[We  have 


V Z  —  a  +  V Z  —  b  =  Vb  —  a  exp(  x  +  iy), 
V Z  —  a  —  V Z  —  b  =  Vb  —  a  exp(— x  —  iy); 


and  it  will  be  found  that 


Z  —  a\  +  \Z  —  b\  =  \b  —  a\  cosh2.x,  \Z  —  a\  —  \Z  —  b\  =  \b  —  a\  cos2y.] 


[X :  237] 


EXPONENTIAL,  AND  CIRCULAR  FUNCTIONS 


539 


22.  The  transformation  z  =  Zl.  If  z  =  Zl .  where  the  imaginary  power 
has  its  principal  value,  we  have 

exp(log  r  +  id)  =  z  =  exp(z  log  Z)  =  exp(i  log  R  —  0) , 

so  that  logr  =  —0,  6  =  logi?  +  2/c7r,  where  k  is  an  integer.  As  all  values  of  k 
give  the  same  point  z,  we  shall  suppose  that  k  =  0,  so  that 

logr  =  —  0,  9  =  log  R.  (1) 

The  whole  plane  of  Z  is  covered  when  R  varies  through  all  positive  values 
and  ©  from  —  ir  to  n:  then  r  has  the  range  exp(— 7r)  to  exp7r  and  6  ranges 
through  all  real  values.  Thus  the  Z-plane  corresponds  to  the  ring  bounded  by 
the  circles  r  =  exp(— n),  r  =  exp7r;  but  this  ring  is  covered  infinitely  often.  If 
however  9  is  allowed  to  vary  only  between  —  it  and  ^ r,  so  that  the  ring  is  covered 
only  once,  then  R  can  vary  only  from  exp(— n)  to  exp7r,  so  that  the  variation 
of  Z  is  restricted  to  a  ring  similar  in  all  respects  to  that  within  which  z  varies. 
Each  ring,  moreover,  must  be  regarded  as  having  a  barrier  along  the  negative 
real  axis  which  z  (or  Z)  must  not  cross,  as  its  amplitude  must  not  transgress 
the  limits  —  ir  and  7 r. 

We  thus  obtain  a  correspondence  between  two  rings,  given  by  the  pair  of 
equations 

z  =  Z\  Z  =  z~\ 

where  each  power  has  its  principal  value.  To  circles  whose  centre  is  the  origin 
in  one  plane  correspond  straight  lines  through  the  origin  in  the  other. 

23.  Trace  the  variation  of  z  when  Z .  starting  at  the  point  exp  tt,  moves  round 
the  larger  circle  in  the  positive  direction  to  the  point  —  exp  7 r,  along  the  barrier, 
round  the  smaller  circle  in  the  negative  direction,  back  along  the  barrier,  and 
round  the  remainder  of  the  larger  circle  to  its  original  position. 

24.  Suppose  each  plane  to  be  divided  up  into  an  infinite  series  of  rings  by 
circles  of  radii 


-(2n+l)7T  7r  7 r  3tt  (2n+l)7r 

*  *  *  1  0  °  5  °  5  °  1  *  *  *  1  0  5  *  *  *  * 

Show  how  to  make  any  ring  in  one  plane  correspond  to  any  ring  in  the  other, 
by  taking  suitable  values  of  the  powers  in  the  equations  z  =  Zl ,  Z  =  z~l . 
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25.  If  z  =  Zl,  any  value  of  the  power  being  taken,  and  Z  moves  along  an 
equiangular  spiral  whose  pole  is  the  origin  in  its  plane,  then  z  moves  along  an 
equiangular  spiral  whose  pole  is  the  origin  in  its  plane. 

26.  How  does  Z  =  zai,  where  a  is  real,  behave  as  z  approaches  the  origin 
along  the  real  axis?  [Z  moves  round  and  round  a  circle  whose  centre  is  the  origin 
(the  unit  circle  if  zai  has  its  principal  value),  and  the  real  and  imaginary  parts 
of  Z  both  oscillate  finitely.] 

27.  Discuss  the  same  question  for  Z  =  za+bi .  where  a  and  b  are  any  real 
numbers. 

00 

28.  Show  that  the  region  of  convergence  of  a  series  of  the  type  anznai, 

—  OO 

where  a  is  real,  is  an  angle,  i.e.  a  region  bounded  by  inequalities  of  the  type 
9q  <  amz  <  6\  [The  angle  may  reduce  to  a  line,  or  cover  the  whole  plane.] 

29.  Level  Curves.  If  f(z)  is  a  function  of  the  complex  variable  z,  we  call 
the  curves  for  which  |/(^)|  is  constant  the  level  curves  of  f(z).  Sketch  the  forms 
of  the  level  curves  of 

z  —  a  (concentric  circles),  ( z  —  a)(z  —  b )  (Cartesian  ovals), 

(z  —  a)/(z  —  b)  (coaxal  circles),  exp z  (straight  lines). 

30.  Sketch  the  forms  of  the  level  curves  of  (z—a)(z—b)(z—c),  (l+z\/3+z2) / z. 
[Some  of  the  level  curves  of  the  latter  function  are  drawn  in  Fig.  59,  the  curves 
marked  i-vii  corresponding  to  the  values 

.10,  2  —  v/3  =  .27,  .40,  1.00,  2.00,  2  +  \/3  =  3.73,  4.53 

of  \f(z)\.  The  reader  will  probably  find  but  little  difficulty  in  arriving  at  a 
general  idea  of  the  forms  of  the  level  curves  of  any  given  rational  function;  but 
to  enter  into  details  would  carry  us  into  the  general  theory  of  functions  of  a 
complex  variable.] 

31.  Sketch  the  forms  of  the  level  curves  of  (i)  2  exp  z,  (ii)  sin  z.  [See  Fig.  60, 
which  represents  the  level  curves  of  sinz.  The  curves  marked  i-viil  correspond 
to  k  =  .35,  .50,  .71,  1.00,  1.41,  2.00,  2.83,  4.00.] 

32.  Sketch  the  forms  of  the  level  curves  of  exp  z—c,  where  c  is  a  real  constant. 
[Fig.  61  shows  the  level  curves  of  |  exp  2  —  1|,  the  curves  I-VII  corresponding  to 
the  values  of  k  given  by  logfc  =  —1.00,  —.20,  —.05,  0.00,  .05,  .20,  1.00.] 
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Fig.  59. 


33.  The  level  curves  of  sin  2  —  c,  where  c  is  a  positive  constant,  are  sketched 
in  Figs.  62,  63.  [The  nature  of  the  curves  differs  according  as  to  whether  c  <  1 
or  c  >  1.  In  Fig.  62  we  have  taken  c  =  .5,  and  the  curves  i-vm  correspond  to 


[X  :  237]  THE  GENERAL  THEORY  OF  THE  LOGARITHMIC, 


542 


Fig.  60. 


Fig.  61. 


k  =  .29,  .37,  .50,  .87,  1.50,  2.60,  4.50,  7.79.  In  Fig.  63  we  have  taken  c  =  2, 
and  the  curves  I-VII  correspond  to  k  =  .58,  1.00,  1.73,  3.00,  5.20,  9.00,  15.59.  If 
c  =  1  then  the  curves  are  the  same  as  those  of  Fig.  60,  except  that  the  origin 
and  scale  are  different.] 


Fig.  62. 


Fig.  63. 
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34.  Prove  that  if  0  <  9  <  ir  then 

cos  9  +  |  cos  39  +  g  cos  5 9  +  •  ■  ■  =  \  log  cot2  4$, 
sin  9  +  4  gin  39  +  4  sin  59  +  •  •  •  = 

and  determine  the  sums  of  the  series  for  all  other  values  of  9  for  which  they  are 
convergent.  [Use  the  equation 

2  +  |2s  +  ^  +  -.  =  iiog(ii|) 

where  z  =  cos  9  +  i  sin  9.  When  9  is  increased  by  ir  the  sum  of  each  series  simply 
changes  its  sign.  It  follows  that  the  first  formula  holds  for  all  values  of  9  save 
multiples  of  ir  (for  which  the  series  diverges),  while  the  sum  of  the  second  series 
is  r  if  2kir  <  9  <  (2k  +  l)7r,  —  f  7r  if  (2k  +  l)7r  <  9  <  (2k  +  2)tt ,  and  0  if  9  is  a 
multiple  of  tx.\ 

35.  Prove  that  if  0  <  9  <  r  then 

cos  9  —  4  cos  39  +  4  Cos  5 9  —  ■  ■  ■  =  j7r, 

sin  9  —  |  sin  39  +  g  sin  59  —  ■  ■  ■  =  \  log(sec  9  +  tan  #)2; 

and  determine  the  sums  of  the  series  for  all  other  values  of  9  for  which  they  are 
convergent. 

36.  Prove  that 


cos  9  cos  a  +  j  cos  29  cos  2a  +  4  Cos  39  cos  3a  +  •  •  •  =  —  \  log{4(cos  9  —  cos  a)2}, 


unless  9  —  a  or  9  +  a  is  a  multiple  of  27r. 

37.  Prove  that  if  neither  a  nor  b  is  real  then 


dx 


l0  ( x-a)(x-b ) 


log(-q)  -  log (—6) 
a  —  b 


each  logarithm  having  its  principal  value.  Verify  the  result  when  a  =  ci,  b  =  —  ci, 
where  c  is  positive.  Discuss  also  the  cases  in  which  a  or  b  or  both  are  real  and 
negative. 

38.  Prove  that  if  a  and  /3  are  real,  and  (3  >  0,  then 


x2 


_ 

(a  +  i/3)2 


iri 

2 (a  +  i/3) ' 


o 
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What  is  the  value  of  the  integral  when  (5  <  0? 

39.  Prove  that,  if  the  roots  of  Ax2  +  2Bx  +  C  =  0  have  their  imaginary  parts 
of  opposite  signs,  then 

dx  7 Tl 

Ax2  +  2 Bx  +  C  ~  s/B 2  -  AC' 

the  sign  of  B2  —  AC  being  so  chosen  that  the  real  part  of  {  V B 2  —  AC } /Ai  is 
positive. 
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(To  Chapters  III,  IV,  V) 

The  Proof  that  every  Equation  has  a  Root 

Let 

Z  =  P(z)  =  a0zn  +  aii  zn~l  +  •  •  •  +  an 

be  a  polynomial  in  2,  with  real  or  complex  coefficients.  We  can  represent 
the  values  of  z  and  Z  by  points  in  two  planes,  which  we  may  call  the  2- 
plane  and  the  Z-plane  respectively.  It  is  evident  that  if  2  describes  a  closed 
path  7  in  the  2-plane,  then  Z  describes  a  corresponding  closed  path  T  in 
the  Z-plane.  We  shall  assume  for  the  present  that  the  path  T  does  not 
pass  through  the  origin. 

To  any  value  of  Z  correspond  an  infinity  of  values  of  am  Z,  differing 
by  multiples  of  2n,  and  each  of  these  values  varies  continuously  as  Z  de¬ 
scribes  T.*  We  can  select  a  particular  value  of  am  Z  corresponding  to  each 
point  of  T,  by  first  selecting  a  particular  value  corresponding  to  the  initial 


Fig.  B. 


value  of  Z,  and  then  following  the  continuous  variation  of  this  value  as 
Z  moves  along  T.  We  shall,  in  the  argument  which  follows,  use  the  phrase 

*It  is  here  that  we  assume  that  F  does  not  pass  through  the  origin. 
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‘the  amplitude  of  Z’  and  the  formula  am  Z  to  denote  the  particular  value 
of  the  amplitude  of  Z  thus  selected.  Thus  am  Z  denotes  a  one-valued  and 
continuous  function  of  X  and  Y,  the  real  and  imaginary  parts  of  Z. 

When  Z,  after  describing  T,  returns  to  its  original  position,  its  ampli¬ 
tude  may  be  the  same  as  before,  as  will  certainly  be  the  case  if  T  does 
not  enclose  the  origin,  like  path  (a)  in  Fig.  B,  or  it  may  differ  from  its 
original  value  by  any  multiple  of  2tc.  Thus  if  its  path  is  like  ( b )  in  Fig.  B, 
winding  once  round  the  origin  in  the  positive  direction,  then  its  amplitude 
will  have  increased  by  2n.  These  remarks  apply,  not  merely  to  T,  but  to 
any  closed  contour  in  the  Z-plane  which  does  not  pass  through  the  origin. 
Associated  with  any  such  contour  there  is  a  number  which  we  may  call  ‘the 
increment  of  amZ  when  Z  describes  the  contour’,  a  number  independent 
of  the  initial  choice  of  a  particular  value  of  the  amplitude  of  Z. 

We  shall  now  prove  that  if  the  amplitude  of  Z  is  not  the  same  when 
Z  returns  to  its  original  position,  then  the  path  of  z  must  contain  inside 
or  on  it  at  least  one  point  at  which  Z  =  0. 

We  can  divide  7  into  a  number  of  smaller  contours  by  drawing  parallels 
to  the  axes  at  a  distance  ch  from  one  another,  as  in  Fig.  C.*  If  there  is,  on 
the  boundary  of  any  one  of  these  contours,  a  point  at  which  Z  =  0,  what 
we  wish  to  prove  is  already  established.  We  may  therefore  suppose  that 
this  is  not  the  case.  Then  the  increment  of  am  Z,  when  z  describes  7,  is 
equal  to  the  sum  of  all  the  increments  of  am  Z  obtained  by  supposing  2  to 
describe  each  of  these  smaller  contours  separately  in  the  same  sense  as  7. 
For  if  z  describes  each  of  the  smaller  contours  in  turn,  in  the  same  sense, 
it  will  ultimately  (see  Fig.  D)  have  described  the  boundary  of  7  once,  and 
each  part  of  each  of  the  dividing  parallels  twice  and  in  opposite  directions. 
Thus  PQ  will  have  been  described  twice,  once  from  P  to  Q  and  once  from 
Q  to  P.  As  z  moves  from  P  to  Q,  am  Z  varies  continuously,  since  Z  does 
not  pass  through  the  origin;  and  if  the  increment  of  am  Z  is  in  this  case  9, 
then  its  increment  when  z  moves  from  Q  to  P  is  —  9]  so  that,  when  we 
add  up  the  increments  of  am  Z  due  to  the  description  of  the  various  parts 

*  There  is  no  difficulty  in  giving  a  definite  rule  for  the  construction  of  these  parallels: 
the  most  obvious  course  is  to  draw  all  the  lines  x  =  Mi,  y  =  Mi,  where  k  is  an  integer 
positive  or  negative. 
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Fig.  C. 


Fig.  D. 


of  the  smaller  contours,  all  cancel  one  another,  save  the  increments  due  to 
the  description  of  parts  of  7  itself. 

Hence,  if  am  Z  is  changed  when  z  describes  7,  there  must  be  at  least 
one  of  the  smaller  contours,  say  71,  such  that  am  Z  is  changed  when  z  de¬ 
scribes  71 .  This  contour  may  be  a  square  whose  sides  are  parts  of  the 
auxiliary  parallels,  or  may  be  composed  of  parts  of  these  parallels  and 
parts  of  the  boundary  of  7.  In  any  case  every  point  of  the  contour  lies  in 
or  on  the  boundary  of  a  square  Ai  whose  sides  are  parts  of  the  auxiliary 
parallels  and  of  length  A . 

We  can  now  further  subdivide  71  by  the  help  of  parallels  to  the  axes 
at  a  smaller  distance  52  from  one  another,  and  we  can  find  a  contour  72, 
entirely  included  in  a  square  A2,  of  side  $2  and  itself  included  in  Ai  such 
that  am  Z  is  changed  when  z  describes  the  contour. 

Now  let  us  take  an  infinite  sequence  of  decreasing  numbers  <5i,  52,  ■  ■  ■ , 
Sm,  . . . ,  whose  limit  is  zero.*  By  repeating  the  argument  used  above,  we 
can  determine  a  series  of  squares  Ai,  A2,  . . . ,  Am,  . . .  and  a  series  of 
contours  71,  72,  . . . ,  jm,  . . .  such  that  (i)  A.m+i  lies  entirely  inside  Am, 

*We  may,  e.g.,  take  Sm  =  <5i/2m_1. 
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(ii)  7m  lies  entirely  inside  Am,  (iii)  am  Z  is  changed  when  z  describes  7 m. 

If  ( xm,ym )  and  {xrn  +  5m,ym  +  Sm)  are  the  lower  left-hand  and  upper 
right-hand  corners  of  Am,  it  is  clear  that  xi,  X2,  . . , ,  xm,  . . .  is  an  increasing 
and  x\  +  hi,  X2  +  82,  . . . ,  xm  +  8m,  ...  a  decreasing  sequence,  and  that  they 
have  a  common  limit  xq.  Similarly  yrn  and  ym  +  Sm  have  a  common  limit  y0, 
and  (x0,y0)  is  the  one  and  only  point  situated  inside  every  square  Am. 
However  small  5  may  be,  we  can  draw  a  square  which  includes  (x0,y0), 
and  whose  sides  are  parallel  to  the  axes  and  of  length  8,  and  inside  this 
square  a  closed  contour  such  that  am  Z  is  changed  when  z  describes  the 
contour. 

It  can  now  be  shown  that 

P(x0  +  iy0 )  =  0. 

For  suppose  that  P(x 0  +  iyo)  =  a,  where  |a|  =  p  >  0.  Since  P(x  +  iy)  is 
a  continuous  function  of  x  and  y,  we  can  draw  a  square  whose  centre 
is  (xo,yo)  and  whose  sides  are  parallel  to  the  axes,  and  which  is  such  that 

I P(x  +  iy)  -  P(x0  +  *2/0 )  |  <  \p 

at  all  points  x  +  iy  inside  the  square  or  on  its  boundary.  At  all  such  points 

P(x  +  iy)  =  a  +  <j), 

where  \(j)\  <  \p.  Now  let  us  take  any  closed  contour  lying  entirely  inside 
this  square.  As  z  describes  this  contour,  Z  =  a  +  0  also  describes  a  closed 
contour.  But  the  latter  contour  evidently  lies  inside  the  circle  whose  centre 
is  a  and  whose  radius  is  \p,  and  this  circle  does  not  include  the  origin. 
Hence  the  amplitude  of  Z  is  unchanged. 

But  this  contradicts  what  was  proved  above,  viz.  that  inside  each 
square  Am  we  can  find  a  closed  contour  the  description  of  which  by  z 
changes  am  Z .  Hence  P{x 0  +  iyo)  =  0. 

All  that  remains  is  to  show  that  we  can  always  find  some  contour  such 
that  am  Z  is  changed  when  z  describes  7.  Now 


+ 


a>2 

a0z2 


a0zn)  ' 


Z  =  a0zn 


+  •••  + 
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We  can  choose  R  so  that 

|®1 1  |®2 1  |  |  ^  r 

\a0\R  +  \a0\R2  +"'+  \a0\R" 

where  6  is  any  positive  number,  however  small;  and  then,  if  7  is  the  circle 
whose  centre  is  the  origin  and  whose  radius  is  R ,  we  have 

Z  =  aoZn(l  +  p), 

where  |p|  <  5,  at  all  points  on  7.  We  can  then  show,  by  an  argument 
similar  to  that  used  above,  that  am(l  +  p)  is  unchanged  as  z  describes 
7  in  the  positive  sense,  while  am  zn  on  the  other  hand  is  increased  by  2mr. 
Hence  am  Z  is  increased  by  2n7r,  and  the  proof  that  Z  =  0  has  a  root  is 
completed. 

We  have  assumed  throughout  the  argument  that  neither  T,  nor  any  of 
the  smaller  contours  into  which  it  is  resolved,  passes  through  the  origin. 
This  assumption  is  obviously  legitimate,  for  to  suppose  the  contrary,  at 
any  stage  of  the  argument,  is  to  admit  the  truth  of  the  theorem. 

We  leave  it  as  an  exercise  to  the  reader  to  infer,  from  the  discussion 
which  precedes  and  that  of  §  43,  that  when  z  describes  any  contour  7  in 
the  positive  sense  the  increment  of  am  Z  is  2k7i,  where  k  is  the  number  of 
roots  of  Z  =  0  inside  7,  multiple  roots  being  counted  multiply. 

There  is  another  proof,  proceeding  on  different  lines,  which  is  often 
given.  It  depends,  however,  on  an  extension  to  functions  of  two  or  more 
variables  of  the  results  of  §§  102  et  seq. 

We  define,  precisely  on  the  lines  of  §  102,  the  upper  and  lower  bounds 
of  a  function  f(x,  y),  for  all  pairs  of  values  of  x  and  y  corresponding  to  any 
point  of  any  region  in  the  plane  of  (x,  y)  bounded  by  a  closed  curve.  And 
we  can  prove,  much  as  in  §  102,  that  a  continuous  function  f(x,y)  attains 
its  upper  and  lower  bounds  in  any  such  region. 

Now 

\Z\  =  \P(x  +  iy)\ 

is  a  positive  and  continuous  function  of  x  and  y.  If  m  is  its  lower  bound  for 
points  on  and  inside  7,  then  there  must  be  a  point  Zq  for  which  \Z\  =  m, 
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and  this  must  be  the  least  value  assumed  by  \Z\.  If  m  —  0,  then  P(z0)  =  0, 
and  we  have  proved  what  we  want.  We  may  therefore  suppose  that  m  >  0. 

The  point  Zq  must  lie  either  inside  or  on  the  boundary  of  7:  but  if  7  is  a 
circle  whose  centre  is  the  origin,  and  whose  radius  R  is  large  enough,  then 
the  last  hypothesis  is  untenable,  since  \P(z)\  — »  00  as  \z\  — »  00.  We  may 
therefore  suppose  that  z0  lies  inside  7. 

If  we  put  z  =  z0  +  C,  and  rearrange  P(z)  according  to  powers  of  (,  we 
obtain 

P(z)  =  P(z0)  +  AiC,  +  A2(2  +  •  •  •  +  An(n, 

say.  Let  A k  be  the  hrst  of  the  coefficients  which  does  not  vanish,  and  let 
|^4fc|  =  /U,  |C|  =  P-  We  can  choose  p  so  small  that 

|dlfc+i|p  +  \Ak+2\p2  +  •  •  •  +  \An\pn  k  <  ^/i. 


Then 

\P(z)  ~P{zo)  -  Ak(k\  <  \ppk , 

and 

\P(z)\  <  \P(z0  +  Ak(k\  +  \ppk. 

Now  suppose  that  z  moves  round  the  circle  whose  centre  is  z0  and 
radius  p.  Then 

P(z0)  +  Ak(k 

moves  k  times  round  the  circle  whose  centre  is  P{zq)  and  radius  = 

ppk:  and  passes  k  times  through  the  point  in  which  this  circle  is  intersected 
by  the  line  joining  P(zq)  to  the  origin.  Hence  there  are  k  points  on  the 
circle  described  by  z  at  which  | P{zq)  +  Ak(k\  =  |P(z0)|  —  ppk  and  so 

\P(z)\  <  \P(z0)\  -  ppk  +  \ppk  —m  —  \ppk  <  m; 

and  this  contradicts  the  hypothesis  that  m  is  the  lower  bound  of  \P(z)\. 

It  follows  that  m  must  be  zero  and  that  P(zq)  =  0. 
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EXAMPLES  ON  APPENDIX  I 

1.  Show  that  the  number  of  roots  of  f(z )  =  0  which  lie  within  a  closed 
contour  which  does  not  pass  through  any  root  is  equal  to  the  increment  of 


{log  f{z)}/2m 


when  z  describes  the  contour. 

2.  Show  that  if  R  is  any  number  such  that 


N  ,  N  ,  .  Kl  1 

R  R2  Rn  ’ 


then  all  the  roots  of  zn  +  a\zn~l  +  ■  ■  ■  +  an  =  0  are  in  absolute  value  less  than  R. 
In  particular  show  that  all  the  roots  of  z 5  —  13 z  —  7  =  0  are  in  absolute  value 
less  than  2^. 

3.  Determine  the  numbers  of  the  roots  of  the  equation  z2p +az+b  =  0  where 
a  and  b  are  real  and  p  odd,  which  have  their  real  parts  positive  and  negative. 
Show  that  if  a  >  0,  b  >  0  then  the  numbers  are  p  —  1  and  p  +  1;  if  a  <  0,  b  >  0 
they  are  p  +  1  and  p  —  1 ;  and  if  b  <  0  they  are  p  and  p.  Discuss  the  particular 
cases  in  which  a  =  0  or  b  =  0.  Verify  the  results  when  p  =  1. 

[Trace  the  variation  of  am  (z2p  +  az  +  b)  as  z  describes  the  contour  formed 
by  a  large  semicircle  whose  centre  is  the  origin  and  whose  radius  is  R,  and  the 
part  of  the  imaginary  axis  intercepted  by  the  semicircle.] 

4.  Consider  similarly  the  equations 


z4q  +  az  +  b  =  0,  zAq~l  +az  +  b  =  0,  z4q+1  +az  +  b  =  0. 


5.  Show  that  if  a  and  (3  are  real  then  the  numbers  of  the  roots  of  the 
equation  z2n + a2 z2n^1  +  =  0  which  have  their  real  parts  positive  and  negative 
are  n  —  1  and  n  +  1.  or  n  and  n,  according  as  n  is  odd  or  even. 

(Math.  Trip.  1891.) 

6.  Show  that  when  z  moves  along  the  straight  line  joining  the  points  z  =  zi, 
z  =  z 2,  from  a  point  near  z\  to  a  point  near  Z2,  the  increment  of 

'  1  +-M 


am 


z  —  Z\ 


Z  -  Z2 
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is  nearly  equal  to  ir. 

7.  A  contour  enclosing  the  three  points  z  =  z\ ,  z  =  ,  z  =  z3  is  defined  by 

parts  of  the  sides  of  the  triangle  formed  by  z  1,  z 2,  z3,  and  the  parts  exterior  to 
the  triangle  of  three  small  circles  with  their  centres  at  those  points.  Show  that 
when  z  describes  the  contour  the  increment  of 

(  1  1  1 

am - 1 - 1 - 

V  z  -  Z\  Z  -  Z2  z  ~  Z3 


is  equal  to  — 27T. 

8.  Prove  that  a  closed  oval  path  which  surrounds  all  the  roots  of  a  cubic 
equation  f(z)  =  0  also  surrounds  those  of  the  derived  equation  f'(z)  =  0.  [Use 
the  equation 


/'(*)  =  /(*) 


1  1  1 

- 1 - 1 - 

Z  —  Z\  Z  —  Z2  z  —  Z3 


where  z\,  Z2,  z3  are  the  roots  of  f(z)  =  0,  and  the  result  of  Ex.  7.] 

9.  Show  that  the  roots  of  f'(z)  =  0  are  the  foci  of  the  ellipse  which  touches 
the  sides  of  the  triangle  (21,22,23)  at  their  middle  points.  [For  a  proof  see 
Cesaro’s  Elementares  Lehrbuch  der  algebraischen  Analysis,  p.  352.] 

10.  Extend  the  result  of  Ex.  8  to  equations  of  any  degree. 

11.  If  f{z)  and  (f>(z)  are  two  polynomials  in  z,  and  7  is  a  contour  which  does 
not  pass  through  any  root  of  f(z),  and  \(j){z) \  <  |/(2)|  at  all  points  on  7,  then 
the  numbers  of  the  roots  of  the  equations 


f(z)  =  0,  f{z)+(j)(z)  =  0 


which  lie  inside  7  are  the  same. 

12.  Show  that  the  equations 

ez  =  az,  ez  =  az2 ,  ez  =  az3, 


where  a  >  e,  have  respectively  (i)  one  positive  root  (ii)  one  positive  and  one 
negative  root  and  (iii)  one  positive  and  two  complex  roots  within  the  circle 
|z|  =  1.  (Math.  Trip.  1910.) 
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(To  Chapters  IX,  X) 

A  Note  on  Double  Limit  Problems 

In  the  course  of  Chapters  IX  and  X  we  came  on  several  occasions  into 
contact  with  problems  of  a  kind  which  invariably  puzzle  beginners  and 
are  indeed,  when  treated  in  their  most  general  forms,  problems  of  great 
difficulty  and  of  the  utmost  interest  and  importance  in  higher  mathematics. 

Let  us  consider  some  special  instances.  In  §  213  we  proved  that 

log(l  +  x)  =  x  -  \x2  +  lx3  -  , 

where  —  1  <  x  ^  1,  by  integrating  the  equation 

1/(1  +  f)  =  1  —  t  +  t2  —  ... 


between  the  limits  0  and  x.  What  we  proved  amounted  to  this,  that 


*  ? 


or  in  other  words  that  the  integral  of  the  sum  of  the  infinite  series 
1  —  t  +  t2  —  . . . ,  taken  between  the  limits  0  and  x,  is  equal  to  the  sum  of 
the  integrals  of  its  terms  taken  between  the  same  limits.  Another  way  of 
expressing  this  fact  is  to  say  that  the  operations  of  summation  from  0 
to  oo,  and  of  integration  from  0  to  x,  are  commutative  when  applied  to 
the  function  (— 1  )ntn,  i.e.  that  it  does  not  matter  in  what  order  they  are 
performed  on  the  function. 

Again,  in  §  216,  we  proved  that  the  differential  coefficient  of  the  expo¬ 
nential  function 

x2 

exp  x  =  1  +  x  +  —  +  ... 
is  itself  equal  to  expx,  or  that 


Dx 


^1  +  x  + 


x2 

DX1  +  Dxx  +  —  +  . . . ; 
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that  is  to  say  that  the  differential  coefficient  of  the  sum  of  the  series  is 
equal  to  the  sum  of  the  differential  coefficients  of  its  terms,  or  that  the 
operations  of  summation  from  0  to  oo  and  of  differentiation  with  respect 
to  x  are  commutative  when  applied  to  xn  jn\. 

Finally  we  proved  incidentally  in  the  same  section  that  the  function 
exp  a:  is  a  continuous  function  of  x,  or  in  other  words  that 

/  x2  \  f2  x2 

lim  [l  +  x  +  —  +  ...  =  1  +  £  +  777  H - =  lim  1  +  lim  x  +  lim  —  +  ...; 

X— >£  \  21  J  21  x— >(,  x— >£,  x— >■£  2! 

i.  e.  that  the  limit  of  the  sum  of  the  series  is  equal  to  the  sum  of  the  limits 
of  the  terms,  or  that  the  sum  of  the  series  is  continuous  for  x  =  £,  or  that 
the  operations  of  summation  from  0  to  oo  and  of  making  x  tend  to  £  are 
commutative  when  applied  to  xn fn\. 

In  each  of  these  cases  we  gave  a  special  proof  of  the  correctness  of 
the  result.  We  have  not  proved,  and  in  this  volume  shall  not  prove,  any 
general  theorem  from  which  the  truth  of  any  one  of  them  could  be  inferred 
immediately.  In  Ex.  XXXVII.  1  we  saw  that  the  sum  of  a  finite  number 
of  continuous  terms  is  itself  continuous,  and  in  §  113  that  the  differential 
coefficient  of  the  sum  of  a  finite  number  of  terms  is  equal  to  the  sum 
of  their  differential  coefficients;  and  in  §  160  we  stated  the  corresponding 
theorem  for  integrals.  Thus  we  have  proved  that  in  certain  circumstances 
the  operations  symbolised  by 


are  commutative  with  respect  to  the  operation  of  summation  of  a  finite 
number  of  terms.  And  it  is  natural  to  suppose  that,  in  certain  circum¬ 
stances  which  it  should  be  possible  to  define  precisely,  they  should  be 
commutative  also  with  respect  to  the  operation  of  summation  of  an  infi¬ 
nite  number.  It  is  natural  to  suppose  so:  but  that  is  all  that  we  have  a 
right  to  say  at  present. 

A  few  further  instances  of  commutative  and  non-commutative  opera¬ 
tions  may  help  to  elucidate  these  points. 
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(1)  Multiplication  by  2  and  multiplication  by  3  are  always  commuta¬ 
tive,  for 

2x3xx=3x2xx 

for  all  values  of  x. 

(2)  The  operation  of  taking  the  real  part  of  z  is  never  commutative 
with  that  of  multiplication  by  i,  except  when  z  —  0;  for 

i  x  R(x  +  iy)  =  ix,  R{i  x  (x  +  iy)}  =  —y. 

(3)  The  operations  of  proceeding  to  the  limit  zero  with  each  of  two 
variables  x  and  y  may  or  may  not  be  commutative  when  applied  to  a 
function  f(x,  y).  Thus 

limjlimfx  +  y)}  =  lim  x  —  0,  limilimfx  +  y)\  =  lim  y  =  0; 

x— ^0  y—t 0  x— >-0  y— >-0  x— >-0  y— >-0 

but  on  the  other  hand 


lim  ( lim  X-^-)  =  lim  *  =  lim  1  =  1, 

ir-s>0  V  j/ — 5-0  X  +  y  J  x->0  X  x^O 

lim  [  lim - —  )  =  lim  — —  =  lim(— 1)  =  — 1. 

y- s>0  X  +  y  J  0  y  0 


(4)  The  operations  . . . ,  lim  . . .  may  or  may  not  be  commutative. 

1  x — ^  1 

Thus  if  x  — >  1  through  values  less  than  1  then 


lim  log(l  +  x)  =  log  2, 

x— 


but  on  the  other  hand 


=  lim{(l  —  x)  +  (x  —  x2)  +  ...}  =  lim  1 

X^l  X— >1 

oo 

=  ^(1-1)  =0  +  0  +  0  +  ---  =0. 

1 


1, 
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The  preceding  examples  suggest  that  there  are  three  possibilities  with 
respect  to  the  commutation  of  two  given  operations,  viz.:  (1)  the  operations 
may  always  be  commutative;  (2)  they  may  never  be  commutative,  except 
in  very  special  circumstances ;  (3)  they  may  be  commutative  in  most  of  the 
ordinary  cases  which  occur  practically. 

The  really  important  case  (as  is  suggested  by  the  instances  which  we 
gave  from  Ch.  IX)  is  that  in  which  each  operation  is  one  which  involves  a 
passage  to  the  limit,  such  as  a  differentiation  or  the  summation  of  an  infinite 
series:  such  operations  are  called  limit  operations.  The  general  question  as 
to  the  circumstances  in  which  two  given  limit  operations  are  commutative 
is  one  of  the  most  important  in  all  mathematics.  But  to  attempt  to  deal 
with  questions  of  this  character  by  means  of  general  theorems  would  carry 
us  far  beyond  the  scope  of  this  volume. 

We  may  however  remark  that  the  answer  to  the  general  question  is 
on  the  lines  suggested  by  the  examples  above.  If  L  and  L'  are  two  limit 
operations  then  the  numbers  LL'  z  and  L' Lz  are  not  generally  equal,  in 
the  strict  theoretical  sense  of  the  word  ‘general’.  We  can  always,  by  the 
exercise  of  a  little  ingenuity,  find  z  so  that  LL' z  and  L'Lz  shall  differ  from 
one  another.  But  they  are  equal  generally,  if  we  use  the  word  in  a  more 
practical  sense,  viz.  as  meaning  ‘in  a  great  majority  of  such  cases  as  are 
likely  to  occur  naturally’  or  in  ordinary  cases. 

Of  course,  in  an  exact  science  like  pure  mathematics,  we  cannot  be 
satisfied  with  an  answer  of  this  kind;  and  in  the  higher  branches  of  mathe¬ 
matics  the  detailed  investigation  of  these  questions  is  an  absolute  necessity. 
But  for  the  present  the  reader  may  be  content  if  he  realises  the  point  of  the 
remarks  which  we  have  just  made.  In  practice,  a  result  obtained  by  assum¬ 
ing  that  two  limit-operations  are  commutative  is  probably  true:  it  at  any 
rate  affords  a  valuable  suggestion  as  to  the  answer  to  the  problem  under 
consideration.  But  an  answer  thus  obtained  must,  in  default  of  a  further 
study  of  the  general  question  or  a  special  investigation  of  the  particular 
problem,  such  as  we  gave  in  the  instances  which  occurred  in  Ch.  IX,  be 
regarded  as  suggested  only  and  not  proved. 

Detailed  investigations  of  a  large  number  of  important  double  limit 
problems  will  be  found  in  Bromwich’s  Infinite  Series. 
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(To  §  158  and  Chapter  IX) 

The  circular  functions 


The  reader  will  find  it  an  instructive  exercise  to  work  out  the  theory 
of  the  circular  functions,  starting  from  the  definition 

fx  dt 

(1)  y  =  y(x)  =  arctanx  =  /  - -.  Df.* 

v  '  V  '  J0  l +  t2 


The  equation  (1)  defines  a  unique  value  of  y  corresponding  to  every  real 
value  of  x.  As  y  is  continuous  and  strictly  increasing,  there  is  an  inverse 
function  x  =  x(y),  also  continuous  and  steadily  increasing.  We  write 


(2)  x  =  x(y)  =  tan  y. 
If  we  define  7 r  by  the  equation 

r°°  d+ 

(3)  '  ' 


2*  = 


1  +  t2’ 


then  this  function  is  defined  for  —  p t  <  y  <  b r. 


We  write  further 
(4)  cos  y  = 


\/l  +  x2' 


sm  y  = 


x 


vTT 


X 


2  ’ 


Df. 


Df. 


Df. 


where  the  square  root  is  positive;  and  we  define  cos  y  and  sin  y,  when  y  is 
—  \n  or  1 7 r,  so  that  the  functions  shall  remain  continuous  for  those  values 
of  y.  Finally  we  define  cos  y  and  sin y,  outside  the  interval  [—\i r,  |7r],  by 


tan(?/  +  7r)  =  tan  y, 

(5)  cos  (y  +  7r)  =  —  cos  y,  Df. 

sin  {%/  +  7r)  =  —  sin  y. 

*  These  letters  at  the  end  of  a  line  indicate  that  the  formulae  which  it  contains  are 
definitions. 
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We  have  thus  defined  cos  y  and  sin  y  for  all  values  of  y ,  and  tan  y  for 
all  values  of  y  other  than  odd  multiples  of  f  7T.  The  cosine  and  sine  are 
continuous  for  all  values  of  y ,  the  tangent  except  at  the  points  where  its 
definition  fails. 

The  further  development  of  the  theory  depends  merely  on  the  addition 
formulae.  Write 

+  x2 

x  =  1 - ’ 

1  -  XxX2 

and  transform  the  equation  (1)  by  the  substitution 

X\  +  U  t  —  X\ 

t  =  - - ,  u  =  - - . 

1  —  X\U  1  +  X\ t 

We  find 

X\  +  x2  fX2  du  fXl  du  n  du 
arc  tan  ^  J  ^  1  +  u2  Jq  1  +  v2  +  J0  1  +  u2 

=  arc  tan  x\  +  arctan^- 


From  this  we  deduce 


(6) 


tan(j/i  +  y2) 


tan  yi  +  tan  y2 
1  —  tan  yi  tan  y2  ’ 


an  equation  proved  in  the  first  instance  only  when  y\ ,  y2 ,  and  y\  +y2  lie  in 
[— ^vr,  \,n\,  but  immediately  extensible  to  all  values  of  y\  and  y2  by  means 
of  the  equations  (5). 

From  (4)  and  (6)  we  deduce 


cos(r/i  +  y2)  =  ±(cos?/i  cos y2  -  siny,  siny2). 


To  determine  the  sign  put  y2  =  0.  The  equation  reduces  to  cos  y\  =  ±  cos  yi, 
which  shows  that  the  positive  sign  must  be  chosen  for  at  least  one  value 
of  y2,  viz.  y2  =  0.  It  follows  from  considerations  of  continuity  that  the 
positive  sign  must  be  chosen  in  all  cases.  The  corresponding  formula  for 
sin(yi  +  y2)  may  be  deduced  in  a  similar  manner. 
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The  formulae  for  differentiation  of  the  circular  functions  may  now  be 
deduced  in  the  ordinary  way,  and  the  power  series  derived  from  Taylor’s 
Theorem. 

An  alternative  theory  of  the  circular  functions  is  based  on  the  theory 
of  infinite  series.  An  account  of  this  theory,  in  which,  for  example,  cos  a:  is 
defined  by  the  equation 


cos  x  =  1  — 


2!  4! 


will  be  found  in  Whittaker  and  Watson’s  Modern  Analysis  (Appendix  A). 


APPENDIX  IV 


The  infinite  in  analysis  and  geometry 

Some,  though  not  all,  systems  of  analytical  geometry  contain  ‘infinite’ 
elements,  the  line  at  infinity,  the  circular  points  at  infinity,  and  so  on.  The 
object  of  this  brief  note  is  to  point  out  that  these  concepts  are  in  no  way 
dependent  upon  the  analytical  doctrine  of  limits. 

In  what  may  be  called  ‘common  Cartesian  geometry’,  a  point  is  a  pair 
of  real  numbers  (x,y).  A  line  is  the  class  of  points  which  satisfy  a  linear 
relation  ax  +  by  +  c  =  0,  in  which  a  and  b  are  not  both  zero.  There  are  no 
infinite  elements,  and  two  lines  may  have  no  point  in  common. 

In  a  system  of  real  homogeneous  geometry  a  point  is  a  class  of  triads  of 
real  numbers  ( x ,  y,  z ),  not  all  zero,  triads  being  classed  together  when  their 
constituents  are  proportional.  A  line  is  a  class  of  points  which  satisfy  a 
linear  relation  ax  +  by  +  cz  =  0,  where  a,  b ,  c  are  not  all  zero.  In  some  sys¬ 
tems  one  point  or  line  is  on  exactly  the  same  footing  as  another.  In  others 
certain  ‘special’  points  and  lines  are  regarded  as  peculiarly  distinguished, 
and  it  is  on  the  relations  of  other  elements  to  these  special  elements  that 
emphasis  is  laid.  Thus,  in  what  may  be  called  ‘real  homogeneous  Cartesian 
geometry’,  those  points  are  special  for  which  z  =  0,  and  there  is  one  special 
line,  viz.  the  line  z  =  0.  This  special  line  is  called  ‘the  line  at  infinity’. 

This  is  not  a  treatise  on  geometry,  and  there  is  no  occasion  to  develop 
the  matter  in  detail.  The  point  of  importance  is  this.  The  infinite  of 
analysis  is  a  ‘limiting’  and  not  an  ‘actual’  infinite.  The  symbol  ‘oo’  has, 
throughout  this  book,  been  regarded  as  an  ‘incomplete  symbol’,  a  symbol 
to  which  no  independent  meaning  has  been  attached,  though  one  has  been 
attached  to  certain  phrases  containing  it.  But  the  infinite  of  geometry  is  an 
actual  and  not  a  limiting  infinite.  The  ‘line  at  infinity’  is  a  line  in  precisely 
the  same  sense  in  which  other  lines  are  lines. 

It  is  possible  to  set  up  a  correlation  between  ‘homogeneous’  and 
‘common’  Cartesian  geometry  in  which  all  elements  of  the  first  system, 
the  special  elements  excepted,  have  correlates  in  the  second.  The  line 
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ax  +  by  +  cz  =  0,  for  example,  corresponds  to  the  line  ax  +  by  +  c  =  0. 
Every  point  of  the  first  line  has  a  correlate  on  the  second,  except  one,  viz. 
the  point  for  which  z  —  0.  When  (x,  y,  z )  varies  on  the  first  line,  in  such 
a  manner  as  to  tend  in  the  limit  to  the  special  point  for  which  z  —  0,  the 
corresponding  point  on  the  second  line  varies  so  that  its  distance  from 
the  origin  tends  to  infinity.  This  correlation  is  historically  important,  for 
it  is  from  it  that  the  vocabulary  of  the  subject  has  been  derived,  and  it  is 
often  useful  for  purposes  of  illustration.  It  is  however  no  more  than  an 
illustration,  and  no  rational  account  of  the  geometrical  infinite  can  be 
based  upon  it.  The  confusion  about  these  matters  so  prevalent  among 
students  arises  from  the  fact  that,  in  the  commonly  used  text  books  of 
analytical  geometry,  the  illustration  is  taken  for  the  reality. 
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